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SULFATED CCR5 PEPTIDES FOR HIV- 1 INFECTION 
5 This application is a continuation-in-part of and claims 
the benefit of U.S. Provisional Application No. 60/267,231, 
filed February 7, 2001, U.S. Provisional Application No. 
60/205, 83S, filed May 19, 2000 and U.S. Provisional 
. Application No. 60/185,667, filed February 29, 2000, the 
10 contents of which are hereby incorporated by reference into 
this application. 

The invention disclosed herein was made with Government 
support under NIH Grant Nos . R01A143 847 (T.D.) and 
15 R01DK54718 (T.P.S.) from the Department of Health and Human 
Services, Accordingly, the government has certain rights in 
this invention. 

Throughout this application, ■ various publications are 
20 referenced within parentheses. Disclosures ' of these 
publications in their entireties are hereby incorporated '-by 
reference into this application to more fully describe the 
state of the art to which this invention pertains. Full 
bibliographic citations for these references may be found 
25 immediately preceding the claims. 

Background of the Invention 

HIV-1 entry into target cells is mediated by the successive 
interaction of the envelope glycoprotein gpl20 with CD4 and 
30 a co-receptor belonging to the seven trans -membrane G 
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protein-coupled chemokine receptor family (Berger et al . 
Ann. Rev. Immunol. 17:657, 1999). Binding of gpl20 to CD4 
exposes or creates a co-receptor binding site on gpl2 0 
(Trkola et al . Nature 384:184, 1996, Wu et al . Nature, 
5 384:179, 1996). CCR5 and CXCR4 are the most physioloqically 
relevant and widely used HIV-l co-receptors (Zhang and 
Moore, J. Virol. 73:3443, 1999). CCR5 mediates the entry of 
R5 isolates and CXCR4 mediates the entry of X4 isolates. 
R5X4 isolates are able to exploit both co-receptors (Berger 

10 et al . Ann. Rev. Immunol. 17:657, 1999). It has been 
demonstrated that specific amino acids including acidic 
residues and tyrosines located within the CCR5 amino- 
term.inal domain (Nt, amino acids 2-31) are essential for 
CCR5-mediated fusion and entry of R5 and R5X4 HIV-1 strains 

15 (Dragic et al . J. Virol. 72:279, 1998 ; Rabut et al . J. 
Virol. 72:3464, 1998; Farzan et al . J, Virol, 72:1160, 
1998; Dorantz et a J . J. Vii'ol . 71:6305, 1997). More 
recently, Farzan et al . demonstrated that tyrosine residues 
in the CCR5 Nt are sulfated (Farzan et al . Cell 96:667, 
20 1999) 

Inhibition of cellular sulfation pathways, including 
tyrosine sulfation, by sodium chlorate decreased the 

25 binding of a gpl20/CD4 complex to CCR5* cells (Farzan et al . 
Cell 96:667, 1999). A number of prior reports had 
implicated a role fox' sulfate moieties in HIV-l entry. 
Several sulfated compounds, such as dextran sulfate, can 
inhibit HIV-1 entry by associating with CD4 or gpl20 

30 (Baeuerle and Huttner J. Cell Biol 105:2655, 1987; Baba et 
al . Proc. Natl. Acad. Sci . USA 85:6132, 1998). Sulfated 
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proteoglycans have been shewn to bind to HIV-1 gpl20 at or 
near its third variable (V3) loop, which also determines 
co-receptor usage (Roderiguez et al . J. Virol. 69:2233, 
1995; Kwang et al . Science 253 : 71 1991} . It is therefore 
5 conceivable that sulfo- tyrosines in the CCR5 Nt also 
interact with gpl20, increasing its affinity for CCR5 . The 
reduction in gpl20/CD4 binding caused by the pre- treatment 
of target cells with sodium chlorate, however, cannot be 
formally attributed to a reduction in CCR5 tyrosine 
10 sulfation since chlorate can inhibit the sulfation of both 
tyrosines and proteoglycans. 

The region of the CCR5 Nt spanning amino acids 2-18 
'contains residues that are critically important for viral 

15 entry (Dragic et al . J. Virol. 72:279, 1998; Rabut et al . 
J. Virol. 72:3464, 1998; Farzan et al . J. Virol. 72:1160, 
1998; Dorantz et al . J. Virol. 71:6305, 1997). We 

previously demonstrated that tyrosines at positions 3, 10 
and 14 were required for optimal co-receptor function, 

20 whereas the Tyrl5Fhe substitution had little effect on 
entry (Rabut et al . J. Virol. 72:3464, 1998). Taken 
together, these findings suggested that KIV-1 entry, may ''be 
critically dependent upon sulfation of Tyr-3, -10 and -14, 
but not Tyr-15. We therefore explored the role of sulfo- 

25 tyrosines in positions 2 , 10 and 14 by synthesizing 
peptides corresponding to amino acids 2-18 of the CCR5 Nt 
and carrying different tyrosine modifications. We first 
tested the ability of the Nt peptides to inhibit binding of 
gpl20/CD4 complexes and anti-CCR5 MAbs to CCRS'' cells. The 

30 specific association of certain peptides with gpl20/sCD4 
complexes or with , ant i - CCR5 MAbs vv^as further confirmed by 



surface plasmon resonance (EIAcore) analysis. Inhibition of 
KIV-1 entry by the CCR5 Nt peptides was also tested. Our 
results suggest that amino acids 2-18 of the CCR5 Nt 
compose a gpl20-bindinc site ' that determines the 
specificity of the interaction between CCR5 and gpl20s from 
R5 and R5X4 isolates. Post - translat ional sulfation of the 
tyrosine residues in the CCR5 Nt is required for gpl2G 
binding and miay critically modulate the susceptibility of 
target cells to HIV-1 infection in vivo. 

CCR5 ' s normal physiologic activities involve binding and 
transducing signals mediated by CC- chemokines , including 
RANTES , MlP-la and MIP-1&, which direct activation and 
trafficking of T cells and other inflammatory cells. As 
such, CCR5 plays an important role in mediating the 
inflammatory reaction of diseases such as rheumatoid 
arthritis and multiple sclerosis. The synovial fluid of 
rheumatoid arthritis patients is highly enriched in CCR5- 
expressing T cells (Qin et al , J Clin Invest 101:746, 
1998) ; and CCR5 is the predominant CC chemokine receptor- 
expressed on T cells in the rheumatoid synovium (Gomez.- 
Reino et al . Arthritis Rheum 42:989, 1999). Similarly, 
infiltration by CCR5-expressing cells is characteristic of 
plaque lesions in patients with multiple schlerosis 
(Balashov et al , Proc Natl Acad Sci USA 96:6873, 1999). 
Such observations provide a rationale for the use of agents 
that block CCR5 for therapy of inf lammiatory/autoimmune 
diseases, including but not limited to arthritis, multiple 
sclerosis, asthmia, psoriasis, autoimmune diabetes, 
transplant rejection, and atherosclerosis. 
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Summary of the Invention 

This invention provides e compound comprising the 
structure : 

eaYDINYYTSE3A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an iscleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein (5 
represents from 0 to 13 amino acids, with the proviso that 
if there are more than 2 amino acids, they are joined by 
peptide bonds in consecutive order and have a sequence 
identical to the sequence set fox-th in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; wherein 0 represents an 
amino group or an acetylated amino group; wherein X 
represents a carboxyl group or an amidated carboxyl group; 
wherein all of a, Y , D, I , N , Y , Y , T , S , E and (3 are joined 
together by peptide bonds; further provided that at least 
two tyrosines in the compound ai-e sulfated. 

This invention also provides a compound comprising the 
structure : 

6aYDINYYTSE3A 

wherein each T represents a threonine, each S represents a 



sei-ine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine;^ and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined by peptide ocncs in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 heginnlng with the I at position 9 and extending 
therefrom in the amino terminal diorection; wherein 3 
represents from 0 to 333 amino acids, with the proviso that 
if there are more than 2 amino' acids, they are joined by 
peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; 

wherein 6 represents an amino group or an acetylated amino 
group; wherein X represents a carbcxyl group or an amidated 
carbcxyl group; wherein all of a, Y, D, I , N, Y, Y, T, S , E and 3 
are joined together by peptide bonds; further provided that 
at least two tyrosines in the compound are sulfated. 

This invention provides a composition which compx-ises a 
carrier and an amount of one of the compounds described 
herein effective to inhibit binding of HIV-1 to a CCR5 
receptor on the surface of a CD4+ cell. 

This invention provides a m.ethod of inhibiting human 
immunodeficiency virus infection of a CD4+ cell which also 
carries a CCR5 receptor on its surface which comprises 
contacting the CD4+ cell with an amount of one of the 



compounds described herein effective to inhibit binding of 
human immunodeficiency virus to the CCR5 receptor so as to 
thereby inhibit human immunodeficiency virus infection of 
the CD4+ cell. 

This invention provides a method of preventing CD4-f cells 
ci a subject fromi becoming infected with human 
immunodeficiency virus which comtprises administex-ing to the 
subject an amount of one of the compounds described herein 
effective to inhibit binding of human immunodeficiency 
virus to CCR5 receptors on the surface of the CD4+ cells so 
as to thereby prevent the subject's CD4+ cells from 
becoming infected with humian immunodeficiency virus. 

This invention provides a method of treating a subject 
whose CD4-i- cells are infected with human immunodeficiency 
virus which comiprises administering to the subject an 
amount of one of the compounds described herein effective 
to inhibit binding of humian immunodeficiency virus to CCR5 
receptors on the surface of the subject's CD4+ cells so as 
to thereby treat the subject, . . 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 licand to a CCR5 .receptor 
which comprises: 

(a) immiobilizing one of the compounds described herein on 
a solid support ; 

(b) contacting the imimobilized compound from step (a) with 
sufficient detectable CCR5 ligand to saturate all 
binding sires for the CCR5 ligand on the imimcbilized 
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compound under conditions permitting binding of the 
CCR5 ligand to the immobili zed . compound so as to form 
a complex; 
(c) removing any unbound CCR5 ligand; 

(c; contacting the complex from step (b) with the agent; 
and 

(e) detecting whether any CCR5 ligand is displaced from 
the complex, w^herein displacement of detectable CCR5 
ligand from the complex indicates that the agent binds 
to the compound so as to thereby identify the agent as 
one which inhibits binding' of the CCR5 ligand to the 
CCR5 receptor. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptox' 
which comprises: 

(a) contacting one of the compounds described herein with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 
under conditions perm.itting binding of the CCR5 ligand 
to the compound so as to form a complex; 

(b; removing any unbound CCR5 ligand; 

(c) measui'ing the amount of CCR5 ligand which is bound to 
the compound in the complex; 

(d) contacting the complex from step (a) with the agent so 
as to displace CCk5 ligand from the complex; 

(e) measuring the amount of CCR5 ligand which is bound to 
the compound in the presence of the agent; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) wHth the amount measured in ster 



(c) ; wherein a reduced amount measured in step (e) 
indicates that the agent binds "to the compound so as 
to thereby identify the agent as one which inhibits 
binding of the CCRS ligand to -the CCR5 receptor. 

This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCRS 
receptor which comprises; 

(a) immobilizing one of the compounds described herein on 
a solid support; 

(b) contacting the immobilized compound from step (a) with 
the agent and sufficient detectable CCRS ligand to 
saturate all binding sites for the CCRS ligand on the 
compound under conditions permitting binding of the 
CCRS ligand to the immobilized compound so as to form 
a complex; 

(c) removing any unbound CCRS ligand; 

(d) measuring the amount of detectable CCRS ligand which 
is bound to the immobilized compound in the complex; 

(e) measuring the amount of detectable CCRS ligand which 
binds to the imimobilized compound in the absence^, of 
the agent; 

(f) comparing the amount of CCRS ligand which is bound to 
the imimobilized compound in step (e) with the amount 
measured in step (d) , wherein a reduced am.ount 
mieasured in step (d) indicates that the agent binds tc 
the compound so as to thereby identify the agent as 
one which inhibits binding of the CCRS ligand to the 
CCRS receptor. 



This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
x'eceptor which comprises: 

(a) contacting one of the compounds described herein with 
the agent and sufficient detectable CCR5 ligand to 
saturate all binding sites for the CCR5 ligand on the 
compound under conditions permitting binding of the 
CCR5 ligand to the compound so as to form a complex; 

(b) removing any unbound CCR5 ligand ; 

(c) mieasuring the amount of detectable CCR5 ligand which 
is bound to the compound in' the complex; 

(d) measuring the amount of detectable CCR5 ligand which 
binds to the compound in the absence of the agent; 

(e) comiparing the amount of CCR5 ligand which is bound to 
the' compound in step (c) with the amount measured in 
step (d) , wherein a reduced amount measured in step 
(c) indicates that the agent binds to the compound so 
as to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

a; imimobilizinc one of the compounds desci'ibed 

herein on a solid support; 
b) contacting the immobilized compound from step a) 
with the agent dissolved or suspended in a known 
vehicle and measuring the binding sipnal 
generated by such contact; 
c; contacting the xmmiobilized ccmipcund from step a; 
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with the known vehicle in the absence of the 
compound and measuring the binding signal 
generated by such contact; 
,d) comparing the binding signal measured in step b) 
with the . binding signal measured in step c) , 
wherein an increased amount measured in step b) 
indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

This invention provides a method of obtaining a composition 
which comprises: 

(a) identifying a compound which inhibits binding of a 
CCR5 ligand to a CCR5 receptor according to one of the 
above methods; and 

(b) admtixing the compound so identified or a homolog or 
derivative thereof with a carrier. 

This invention provides a compound having the structure: 
A - (aYDINYYTSEpX)^ 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D repi'esents an aspartic acid, 
each 1 represents an isoieucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position S 
and extending therefrom in the amino terminal direction; 
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wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 atnino acids, they are 
joined together by peptide bonds, in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with tne P at position 19 and extendino 
therefrom in the carboxy ter-minal dii-ection; wherein a 
represents a carboxyl group or an ami da ted carboxyl group; 
wherein all of a, Y, D, I , N, Y, Y, T, S , E and p are joined 
together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

This invention also provides a compound having the 
structure : 

( eaYDINYYTSEp ) „ - A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents" an 
asparagine; wherein a i-epresents from 0 to 9 amino acids, 
with the proviso that if there are m.ore than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino termiinal direction; 
wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 



have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy termii^^al direction; wherein 6 
represents, an amino group or an -acetylated amino group; 
wherein all of a , Y , D , I , Y , Y , T , S , E and B are joined 
together by peptide honds , further provided that at leasL 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to S , A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

This invention provides a compound having the structure: 

A- (aYDINYYTSEPX) J, 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the 1 at position 9 
and extending therefrom in the amino terminal direction; 
wherein (i represents from 0 to 333 amino acids, with the 
proviso that if there are more than 2 amino acids ; they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein a 
represents a carboxyl group or an amidated carboxyl group; 
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whei-ein all of a , Y , D , I , N, Y , Y , T , S , E and 3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, wherein n i"S an integer from 1 to S, 
L as a polymer, and the solid line represents up to & 
linkers which attach the structure in parentheses to A. 

This invention also provides a compound having the 
structure : 

( eaYDINYYTSEp ) ^- i\ 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoieucine; and each N represents an 
asparagine; wherein a repx'esents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
cx'der and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position S 
and extending therefrom in the am.ino terminal direction; 
whei*ein 3 repi'esents from 0 to 333 amino acids, with the 
proviso that if there are irtcre than 2 amiino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical ro the sequence set forth in SEQ 
ID NO: 1 beginning with the ? at position 19 and extending 
therefrom in the carboxy termiinal direction; wherein 6 
represents an amino group or an acetylated amino group; 
V7herein all of a , Y, D , I , N, Y, Y , T , £ , E and p are joined 
together by peptide bonds, further provided that at least 
two tyrosines in the ccmipound are sulfated, wherein n is an 
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inteQer from 1 co 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 
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Brlef Description of the Figures 



Fig. I Effect of peptides on gpl20j3i.-L binding to CCR5 . 

L1.2-CCR5'* eel IF were** incubated with the 
-"^ biotinylated gpl2 G jr_pl/CD4 - IgG2 complex in the 

presence of difiereni: concencrat ion of peptides 
(a) S-3/10/14, £-10/14, S-10, S-14 or (b) P- 
3/10/14, P-10/14, SR-2/12, SR-10/14, TS-10/14. 
The exuent of complex binding in the absence of 
10 peptide was defined as 100% (m.f .i. -40 + 5) . 

Binding in the presence of peptide is expressed 
as a percentage of control. When CCR5 -negative 
cells were used, binding of the gpl2 Oj^.fl/CD4 - IgG2 
complex was negligible (-10%, m.f .i. -2±1) . The 
15 values shown are from a representative 

experiment . 



Fig. 2 Binding of the gpl20/sCD4 complex to sulfated and 

phosphorylated peptides . 

20 Biotinylated peptides were immobilized on a 

sensor chip and their ability to associate with 
gpl20/sCD4 was analyzed by BIAcore . RU values as 
a function of time were mieasured in the absence 
of peptide (gray dotted lines) , in the presence 

25 of phosphorylated peptide (black dotted lines) or 

xn the presence ot sulfated peptide (solid black 
lines) . We performed binding analyses with the 
following proteins: (a) gpl2 0^.i,.pL/sCD4 , (b) gpl20jp.. 
PL, (c) sCD4 , (d) DV3gpl2Cji,_pL/sCD4 , (e) 

30 9P12 0dh.23/sCD4 , (f) gpl2 0oHi23. (9) gpl 2 0^^^, / sCD4 and 
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(h) gpl20^i. 

Effect of peptides on MAb binding to CCR5 . 
hi .2-CCR5^ cells were incubated with the anti-CCRS 
MAbs in the presence of peptides. The extent of 
MAb binding in the absence of peptide v^as defined 
as 100% (m.f.i. -50-400, depending on the MAb). 
Binding in the presence of peptide is expressed 
as a percentage of control. Vvhen CCR5 -negative 
cells were -used; binding of MAbs was negligible 
(m.f.i. -2±1) . Each data point represents the 
mean ± s.d. of three replicates. 

Binding of KAbs to sulfated and phosphorylat ed 
peptides . 

Biotinylated peptides were immobilized on a 
sensor- chip and their ability to associate with 
anti-CCRS MA.bs was analyzed by BIAcore . RU values 
as a function of time were measured in the 
^hsetnce of peptide (gray dotted lines) , in the 
presence of phosphorylat ed peptide (black dotted 
lines) or in the pi'esence of sulfated peptide 
(solid black lines) , VJe performed binding 
analyses with (a) PA,S , (b) PAIO and (c) 2D7 . 

Effect of peptides on viral entry. 

HeLa-CD4"CCR5'' cells were infected with Nlluc^env" 
pseudotyp^ed with different viral envelopes in the 
presence of peptides. Lucif erase activity 
(r.l.u.) was mesured 48 h post - infect ion . The 
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extent of entry in the absence of peptide was 
defined as 100% (r.l.u.. -25,000± 9,000). 
Background r.l.u. values were -7±2 . Each data 
point represents the mean + s.d, of three 



Fig. 6 CCR5 Nt peptide sequences and labels 

The primary sequence of each peptide is indicated 
in the left column and the corresponding label is 
10 indicated in the right column. Sulfated tyrosine 

residues are designated by black boxes and white 
boxes designate phosphorylated tyrosine residues. 

Fig. 7 Gpl2 0/CD4 complex binding to CCR5 Nt 

15 sulf opeptides 

Peptide 2-18 was bound to streptavidin-coated 
biosensor chips and gpl2 0j3^.fl/sCD4 (dotted line) 
or gpl2 Ojr.pl/CD4 -IgGs (solid line) were flowed over 
the sensor chip surface. Resonance units (RU) 

20 were measured as a function of time using a 

Biacore X and reflect complex-peptide binding 
(a) . Sulfopeptide 2-18 (solid symbols) or 
phosphopeptide 2-18 (P) (clear symbols) were 
immobilized on streptavidin-coated ELISA plates 

25 and incubated with gpl20/CD4-IgG2 complexes. Gpl20 

proteins were derived from the R5 isolate JR-FL 
(squares), the R5X4 isolate DH123 (circles) and 
the X4 isolate LAI (diamonds) . Complexes-peptide 
binding was detected by an HRP- conjugated goat 

30 anti-human IgG antibody. CD. at 450 nm was 



measured after addition of KRP substrate and is 
expressed as a function of CD4-IgG2 concentration 
(b) . Biotinylated &ulfopeptide 2-18 was 
immobilized on streptavidin-coated plates and 
incubated with gpl2 0/CD4 - IgG;. complex in the 
presence of increasing concentrations of: PA 8 
(solid squares) , TAK-779 (triangles) , Rantes 
(inverted triangles) , MIP-1 (diamonds) , MIP-1 
(circles) or SDF-1 (clear squares) . Binding of 
the complexes to the peptide was detected by 
incubation with HRP-conjugated goat anti-human 
IgG antibody. 0,D. at 450 nm was measured after 
addition of KRP substrate and percentage of 
binding was expressed as a function of inhibitor 
concentration . 

Binding of anti-CCR5 MAbs to CCR5 Nt peptides. 

Sulf opeptides (a) or phcsphopept ides (b) were 
immobilized on streptavidin-coated ELISA plates 
and incubated with anti-CCR5 MAbs PAS (solid 
squares) , PAIO (clear circles) , PAll (solid 
circles) , PA.12 (solid diamonds) or PA14 (solid 
triangles) . Binding of the MAbs to the peptides 
was detected by an HRP-conjugated goat anti-mouse 
IgG antibody. O.D. at 450 nm was m.easured after 
addition of HR? substrate and expressed as a 
function of lyiAb concentration. 

Binding of gpl2 Oj,^.fi,/CD4 - IgGj to different CCR5 Nt- 
based peptides. 
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Streptavidin plates were coated with 2-18 (black 
squares) , 10-18 (black circles) , 8-15 (black 
diamonds), 6-16 (black stars) ^ 10-15 (white 
square), 10-18 (IIA/IBA) (black triangles). Plates 
5 were then incubated with gpl2 Ojr.pl/CD4 - IgGj 

complex. Binding of the complex to the peptide 
was detected by an HRP- conj ugated goat anti-human 
IgG antibody. O.D. at 4 50 nm was measured aftei* 
additicn of HRP substrate and expressed as a 
10 function of CD4-lgG2 concentration (nM) . 

Fig, 10: Inhibition of gpl2 0/CD4 - IgGj complex binding to 
sulf o-peptides by anti-gpl20 MAbs 

Biotinylated ■ sulf opeptide 2-18 was bound to 
streptavidin-coated biosensor chips and solutions 
of either gpl2 0jr.pl/CD4 - IgG2 complex (black bars) 
/CD4-IgG2 complex (white bars) were 
flowed over the surface of the chip in the 
presence of different anti-gpl20 MAbs. The names 
of the MAbs and the location of their epitopes 
are indicated along the x-axis. Resonance uijiits 
(RU) were measured as a function of time' using a 
Biacore X and reflect complex-peptide binding in 
the presence of the MAbs. Gpl2 0/CD4-IgG2 binding 
was calculated using the formula: (RU in the 
presence of lyiAbs) / (RU in the absence of MAbs) 
xlOC%. The values shown are from a sample 
experiment . 
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Fig. 11: Binding of gpl20 mutants to sul fo -peptide and 
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Fig. 12: 
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wild type CCR5 . 

Sulf o-pept ide 2-18 was immobilised on 

streptavidin-coated plates and incubated with a 
mixture of gpl20-containing supernatants and CD4- 
IgG^. Pept ids-complex binding was detected by an 
HRP-conjugated goat anti-human IgG antibody. O.D. 
at 450 nm was measured after addition of HRP 
substrate and normialized for binding of the gpl20 
mutants to CD4-IgG2. The doted line represents the 
normalized value for the binding of the wild type 
gpl2 0 to the peptide. The miutated amino acids and 
their locations in gpl20 are indicated along the 
X-axis (a) . L12 -CCPs-B-*- cells were incubated with a 
mixture of gpl20-containing supernatants and CD4 - 
IgGj. Binding of the complex was detected by FACS 
analysis after addition of strept avidin- PE . 
Percentage of gpl2 0/CD4-IgG2 binding to CCR5 was 
normalized for gpl2 0 binding to CD4-IgG2. The 
doted line represents the normalized value for 
the binding of wild-type gpl20 to the L12-CCR5* 
cells. The mutated amino acids and their 
locations in gpl20 are indicated along the x-axis 
(b) . 



Amino acid sequences- of CCR5 Nt-based peptides. 
The peptides are named according to the positions 
of their first and last residues in the full- 
length sequence of CCR5 . They contain either 
sulf otyrosines (black boxes) or phosphotyrosines 



{ 



-22- 

(white boxes) n positions 10 and 14. Residues 
Asp-11 and Glu-18 in peptide 10-18 (11A/18A) are 
substituted for alanines. All peptides carry a 
carboxy terminal GAG • spacer followed by a 
5 biotinylated lysine. 



Fig. 13: Amino acid conservation among R5 isolates. 

Envelope sequences from 25 R5 strains described 
in the HIV Database and retrieved from the 

10 National Center for Biotechnology Information 

GenBank were aligned and percentage of 
conservation for the indicated residues was 
calculated and combined with results from Hung et 
al . , 1999 (REF) . Alanine mutants showing more 

15 than 50 % decrease in sulfopeptide 2-18 binding 

compared to the wild type are highlighted in 
gray. 
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Detailed Description of the Inve^Tition 

The plasmids CD4-IgG2-HC-pRcCMV and CD4 -kLC-pRcCMV were 
deposited pursuant to, and in satisfaction of, the 
requir-ements of the Budapest Treaty on the International 
5 Recocrnition of the Deposit of Microorganisms (the "Budapest 
Treaty") for the Purposes of Patent Procedure with the 
American Type Culture Collection (ATCC) , 10801 University 
Boulevard, Manassas, Virginia 20110-2209 under ATCC 
Accession Nos . 75193 and 75194, respectively. 

10 

The plasmids designated PPI4 - tPA-gpl20j^,pL and PPI4-tPA- 
gpl20LAi were deposited pursuant to, and in satisfaction of, 
the requirements of the Budapest Treaty on the 
International Recognition of the Deposit of Microorganisms 

15 for the Purposes of Patent Procedure with the American Type 
Culture Collection (ATCC) , 10801 University Boulevard, 
Manassas, Virginia 20110-2209 under ATCC Accession Nos, 
75431 and 75432, respectively. These plasmids were 

deposited with ATCC on March 12, 1993. These eukaryotic 

20 shuttle vectors contain the cytomegalovirus major 
immediate-early (CMV MIE) promoter/enhancer linked to ...the 
full-length HIV-1 envelope gene whose signal sequence was 
replaced with that derived from tissue plasminogen 
activator. In the vector, a stop codon has been placed at 

25 the gpl20 C-terminus to prevent translation of gp41 
sequences, which are present in the vector. The vector 
also contains an ampicillin resistance gene, an SV40 origin 
of replication and a DKFR gene whose transcription is 
driven by the (B-globin promoter. 
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The monoclonal antibodies PA8 , PAIO, PAll, PA12, and PA14 
were deposited pursuant to and in satisfaction of, the 
requirements of the Budapest Tr.eaty on the International 
Recognition of the Deposit of 'Microorganisms for the 
5 Purposes of Patent Procedure with the American Type Culture 
Collection (ATCC) , 10801 University Boulevard, Manassas, 
Virginia 20110-2209 on December 2, 1998 under the following 
Accession Nos . : ATCC Accession No. KB-12605 (FAB), ATCC 
Accession No.HB-12607 (FAIO) , ATCC Accession No. HB-12608 
10 (PAID, ATCC Accession No. HB-12609 (PA12), and ATCC 
Accession No. HB-12610 (PA14) , 



As used herein, the following standard abbreviations are 
used throughout the specification to indicate specific 
15 amino acids: 



A=ala= alanine 
N=asn=asparagine 
C-cys= cysteine 

20 E=glu=glutamic acid 

H=his=histidine 
L=leu= leucine 
M=met=methionine 
P=pro=proline 

25 T=thr=: threonine 

Y=tyr= tyrosine 
B=asx=asparagine or a 
Z=glx=glutamine or gl" 



R-arg=arginine 
D=asp=aspartic acid 
Q=gln=glut amine 
G=gly=glycine 
I=ile=isoleucine 
K=lys= lysine 
F=phe=phenyl alanine 
S=ser=serine 
W=trp=tr*yptophan 
V=val =val ine 

artic acid 

amic acid 
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used throughout the specification to indicate specific 

nucleotides : C=cytosine ; A=adenosine; T=thymidine; 
G= guanosine; and U=uracil . 

5 This invention provides a compound comprising the 
structure; 

GaYDINYYTSEgX 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 

10 represents a tyrosine; each D represents an aspartic acid, 
each 1 represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 ,to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order and 

15 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein 3 
represents from 0 to 13 amino acids, with the proviso that 
if there are more than 2 amino acids, they are joined by 

20 peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO.: - 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; 

wherein 6 represents an amino group or an acetylated amino 
25 group; wherein A represents a carboxyl group or an amidated 
carbcxyl group; wherein all of a,Y,D,I,N,Y,Y,T,S,E and 3 
are joined together by peptide bonds; further provided that 
at least two tyrosines in the compound are sulfated. 

30 In one embodiment of the above compound, the compound is 
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peptide which comprises consecutive amino acids having the 
sequence YDINYYTSE. 

In one embodiment of the above compound, the tyrosines at 
5 positions 1 and 5 of the sequence YDINYYTSE are sulfated. 

In one embodim.ent of the above compound, a represents less 
than 9 amino acids. In another embodiment of the above 
compound, a represents less than 8 amino acids. In another 

10 embodiment of the above compound, a represents less than 7 
amino acids. In another embodiment of the above compound, a 
represents less than 6 amino acids. In another embodiment 
of the above compound, a repx'esents less than 5 amino 
acids. In another embodiment of the above compound, a 

15 represents less than 4 amino acids. In another embodiment 
of the above compound, a represents less than 3 amino 
acids. In another embodiment of the above compound, a 
represents less than 2 amino acids. In another embodiment 
of the above compound, a represents less than 1 amino acid. 

20 

In one embodiment of the above compound, |3 represents less 
than 17 amino acids. In one embodiment of the above 
compound, (3 represents less than 16 amino acids. In one 
embodiment of the above compound, 3 represents less than 15 

25 amino acids. In one embodiment of the above com.pound, 3 
represents less than 14 amano acids. In one embodiment of 
the above compound, 3 represents less than 13 amino acids. 
In one emibodiment of the above compound, 3 represents less 
than 12 amino acids. In one embodiment of the above 

30 compound, 3 represents less than 11 amino acids. In one 
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embodiment of the above compound, (3 represents less than 10 
amino acids. In one embodiment of the above compound, 3 
represents less than 9 amino acids. In one embodiment of 
the above compound, 3 repz^esents less than 8 amino acids. 
5 ■ In one embodiment of the above compound, P represents less 
than 7 amino acids. In one embodiment of the above 
compound, 3 represents less than. 6 amino acids. In one 
embodiment of the above compound, 3 represents less than 5 
amino acids. In one embodiment of the above compound, 3 
10 represents less than 4 amino acids. In one embodimient of 
the above compound, 3 represents less than 3 amino acids. 
In one embodiment of the above compound, 3 represents less 
than 2 aniino acids . In one embodiment of the above 
compound, 3 represents less than 1 amino acid. 

15 

This invention also provides a compound comprising the 
structure : 

eaYDINYYTSESA 

wherein each T represents a threonine, each S represents a 
20 serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; whex^ein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
25 they are joined by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino term.inal direction; wherein 3 
represents from 0 to 333 amino acids, with the proviso that 
30 if there are more than 2 amino acids, they are joined by 



peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO; 1 
beginning with the P at position. IS and extending therefrom 
in the carboxy terminal direction;' 
5 wherein 9 represents an amino group or an acetylated amino 
group; wherein X represents a carboxyl group or an amidated 
carboxyl group; wherein all of a , Y , D, I , N, Y , Y, T , S , E and P 
are joined together by peptide bonds; further provided that 
at least two tyrosines in the compound are sulfated. 

10 

In the compounds described herein and as' exemplified above, 
the (3 in each compound may alternatively represent from 0 
to 333 amino acids. 

15 In one embodiment of the compounds described herein, 3 
represents less than 300 amino acids. In another embodiment 
of the above compound, 3 represents less than 2 50 amino 
acids. In another embodiment of the above compound, 3 
represents less than 200 amino acids. In another embodiment 

20 of the above compound, 3 represents less than 150 amino 
acids . In another embodiment of the above compound., . 3 
represents less than 100 amino acids. In another embodiment 
of the above compound, 3 represents less than 75 amino 
acids. In another embodiment of the above compound, 3 

25 represents less than 50 amiino acids. In another embodiment 
of the above compound, 3 represents less than 4 0 amino 
acids. In another embodiment of the above compound, 3 
represents less than 35 amino acids. In another embodiment 
of the above' compound, 3 represents less than 30 amino 

30 acids. In another embodiment of the above compound, 3 



represents less than 25 amino acids. In another embodiment 
of the above compound, 3 represents less than 20 amino 
acids. In another embodiment of the above compound, |3 
represents less than 19 amino acids. In another embodiment 
of the above compound, 3 represents less than 18 amino 
acids. In another embodiment of the above compound, 3 
represents less than 17 amino acids. In another embodiment 
of the above compound, 3 represents less than 16 amino 
acids. In another embodiment of the above compound, 3 
represents less than 15 amino acids. In another embodiment 
of the above compound, 3 represents less than 14 amino 
acids. In another embodiment of the above compound, 3 
represents less than 13 amino acids. In another embodiment 
of the above compound, 3 represents less than 12 amino 
acids. In another embodiment of the above compound, 3 
represents less than 11 amino acids. 

In one errbodiment of the above compound, a represents less 
than 9 amino acids. In another embodiment of the above 
compound, oc represents less than 8 amino acids. In another 
embodiment of the above compound, a represents less th^in. 7 
amino acids. In another embodiment of the above compound, a 
represents less than 6 amino acids. In another embodiment 
of the above compound, a represents less than 5 amino 
acids. In another embodiment of the above compound, a 
represents less than 4 amino acids. In another embodiment 
of the above compound, a represents less than 3 amino 
acids. In another embodiment of the above compound, a 
represents less than 2 am.ino acids. In another embodiment 
of the above compound,- a represents less than 1 amino acid. 
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The CCR5 amino acid sequence is the following and is set 
forth in SEQ ID NO : 1 : 

1 MDYQVSSPIYDINYyTSEPCQKINVKQIAARLLPPLYSLV 
41 FIFGFVGNMLVILILINCKRLKSMTDIYLIiNLAISDLFFL 

5 81 LTVPFWAHYAAAQWDFGNTMCQLLTGLYFIGFFSGIFFII 
12 1 LLTIDRYLAWHAVFALKARTVTFGWTSVITWWAVFAS 
161 LPGI I FTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIVI 

2 01 LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTI 
241 MIVYFLFWAPYNIVLLLNTEQEFFGLNNCSSSNRLDQAMQ 

10 2 81 VTETLGMTHCC INPII YAFVGEKFRNYLLVFFQKHI AKRF 

3 21 CKCCS I FQQEAPSRAS SVYTRSTGEQE I S VGL 3 52 

The OCRS nucleotide sequence is the following and is set 
forth in SEQ ID NO : 2 : 

15 1 GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAGCTAG CAGCAAACCT 
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61 


TCCCTTCACT 


ACAAAACTTC 


ATTGCTTGGC 


CAAAAAGAGA 


GTTAATTCAA 


TGTAGACATC 


121 


TATGTAGGCA 


ATTAAAAACC 


TATTGATGTA 


TAAAACAGTT 


TGCATTCATG 


GAGGGCAACT 


181 


AAATACATTC 


TAGGACTTTA 


TAAAAGATCA 


CTTTTTATTT 


ATGCACAGGG 


TGGAACAAGA 


241 


TGGATTATCA 


AGTGTCAAGT 


CCAATCTATG 


ACATCAATTA 


TTATACATCG 


GAGCCCTGCC 


301 


AAAAAATCAA 


TGTGAAGCAA 


ATCGCAGCCC 


GCCTCCTGCC 


TCCGCTCTAC 


TCACTGGTGT 


361 ' 


TCATCTTTGG TTTTGTGGGC AACATGCTGG TCATCCTCAT CCTGATAAAC TGCAAAAGGC 


421 


TGAAGAGCAT 


GACTGACATC 


TACCTGCTCA 


ACCTGGCCAT 


CTCTGACCTG 


TTTTTCCTTC 


481 


TTACTGTCCC 


CTTCTGGGCT 


CACTATGCTG 


CCGCCCAGTG 


GGACTTTGGA 


AATACAATGT 


541 


GTCAACTCTT 


GACAGGGCTC 


TATTTTATAG 


GCTTCTTCTC 


TGGAATCTTC 


TTCATCATCC 


601 


TCCTGACAAT 


CGATAGGTAC 


CTGGCTGTCG 


TCCATGCTGT 


GTTTGCTTTA 


AAAGCCAGGA 


661 


CGGTCACCTT 


TGGGGTGGTG 


ACAAGTGTGA 


TCACTTGGGT 


GGTGGCTGTG 


TTTGCGTCTC 


721 


TCCCAGGAAT 


CATCTTTACC 


AGATCTCAAA 


AAGAAGGTCT 


TCATTACACC 


TGCAGCTCTC 


781 


ATTTTCCATA 


CAGTCAGTAT' 


CAATTCTGGA 


AGAATTTCCA 


GACATTAAAG 


ATAGTCATCT 


841 


TGGGGCTGGT 


CCTGCCGCTG 


CTTGTCATGG 


TCA-TCTGCTA 


CTCGGGAATC 


CTAAAAACTC 


901 


TGCTTCGGTG 


TCGAAATGAG 




ACAGGGCTGT 


GAGGCTTATC 


TTCACCATCA 


961 


TGATTGTTTA 


TTTTCTCTTC 


TGGGCTCCCT 


ACAACATTGT 


CCTTCTCCTG 


AACACCTTCC 


1021 


AGGAATTCTT 


TGGCCTGAAT 


AATTGCAGTA 


GCTCTAACAG 


GTTGGACCAA 


GCTATGCAGG 


1081 


TGACAGAGAC 


TCTTGGGATG 


ACGCACTGCT 


GCATCAACCC 


CATCATCTAT 


GCCTTTGTCG 


1141 


GGGAGAAGTT 


CAGAAACTAC 


CTCTTAGTCT 


TCTTCCAAAA 


GCACATTGCC 


AAACGCTTCT 



12 01 GCAAATGCTG TTCTATTTTC CAGCAAGAGG CTCCCGAGCG AGCAAGCTCA GTTTACACCC 

12 61 GATCCACTGG GGAGCAGGAA .ATATCTGTGG GCTTGTGACA CGGACTCAAG TGGGCTGGTG 

13 21 ACCCAGTCAG AGTTGTGCAC ATGGCTTAGT TTTCATACAC AGCCTGGGCT GGGGGT 

The YDINYYTSE sequence corresponds to amino acid residues 
10-18 of the CCR5 sequence set forth above. 

As used herein, "CCR5" is a chemokine receptor which binds 
members of the CC group of chemokines and whose amino acid 
sequence comprises that provided in Genbank Accession 
Number 1705896 and related polymorphic variants. The 
nucleotide sequence comprises that provided in Genbank 
Accession Number X91492. In one embodiment, the above 
compound may correspond to the extracellular portion of 
CCR5 . The first 31 amino acids of CCR5 correspond to the 
extracellular portion of CCR5 . Accordingly, the 
extracellular portion extends from the methionine at 
position number 1 to the arginine at position number 31 of 
SEQ ID N0:1. In another embodiment, the above compound may 
correspond to the amino- terminal portion of CCR5 . As used 
herein, "N-terminus" or amino -terminus means the sequence of 
amino acids spanning the initiating methionine and "the 
first transmembrane region. 

A-S used herein, "H2N" refers to the N-terminus or amino- 
terminus. As used herein, "COOK" refers to the C-terminus 
or carboxy- terminus . 

Various tyrosines of the compounds described herein may be 
sulfated. These include but aare not limited to the 
tyrosines at positions 3, 10 and 14 of amino acid sequence 
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set forth in SEQ ID NO : 1 . Accordingly, in one embodiment, 
the tyrosines at positions 10 and 14 are sulfated. In 
another embodiment, the tyrosines- at positions 3 and 14 are 
sulfated. In another embodiment, the tyrosines at positions 
5 3 and 10 are sulfated. In another embodiment, the tyx'osines 
at positions 3, 10 and 14 are sulfated. Other tyrosines in 
the sequence set forth in SEQ ID NO : 1 may also be sulfated. 

This invention provides a composition comprising one of the 
10 compounds described, herein and a detectable marker attached 
thereto. In one embodiment of the composition, the 
detectable marker is biotin. In one embodiment of the 
composition, the detectable marker is attached at the C- 
terminus of the compound. 

15 

The compounds of the subject invention may also be isolated 
or purified. In one embodiment the compound is labeled with 
a detectable marker. As used herein, chemical "labels" 
include radioactive isotopes, fluorescent groups and 
20 affinity moieties such as biotin that facilitate detection 
of the labeled peptide. Other chemical labels are wj^ll- 
known to those skilled in the art. Methods for attaching 
chemical labels to peptides are well-known to the skilled 
artisan . 

25 

As used herein, "peptide" and "polypeptide" ax-e used to 
denote two or more amino acids linked by a peptidic bond 
between the a-carboxyl group of one amino acid and the a- 
amino group of the next amino acid. Peptides may be 
30 ' produced by solid-phase synthetic methods that are well- 
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known to those skilled in the art. In addition to the above 
set of twenty amino acids that are used for protein 
synthesis in vivo, peptides may contain additional amino 
acids, including but not limited to hydroxyproline , 
sarcosine, and y- carboxyglutamate . The peptides may contain 
modifying groups including but not limited to sulfate and 
phosphate moieties. Peptides can be comprised of L- or D- 
amino acids, which are mirror-image forms with differing 
optical px'operties. Peptides containing D-amino acids have 
the advantage of being less susceptible to proteolysis in 
vivo . 

Peptides may by synthesized in monomeric linear form, 
cyclized form or as oligomers such as branched multiple, 
antigen - peptide (MAP) dendrimers (Tarn et al . ■ Biopolymers 
51:311, 1999). Nonlinear peptides may have increased 
binding affinity by virtue of their restricted 
conformations and/or oligomeric nature. Peptides may also 
be produced using recombinant methods as either isolated 
peptides or as a portion of a larger fusion protein that 
contains additional amino acid sequences. 

Peptides may be chemically conjugated to proteins by a 
variety of , well-known methods. Such peptide-prot.ein 
conjugates can be formulated with a suitable adjuvant and 
administered parenterally for the purposes of generating 
polyclonal and monoclonal antibodies to the peptides of 
interest. Alternatively, unconjugated peptides can be 
formulated with adjuvant and administered to laboratory 
animals for the purposes of generating antibodies. Methods 
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for generating and isolating such antibodies are well-known 
to those skilled in the art. 

This invention provides derivatives of the above compound. 
5 As used herein, a "derivative" peptide is one whose amino 
acid sequence is nonidentical to the reference peptide but 
which possesses functionally similar binding properties. 
Derivative peptides may also contain N-terminal, C-terminal 
and/or internal insertions, deletions, or substitutions of 

10 amino acids, with the proviso that such insertions, 
deletions and substitutions do not abrogate the binding 
properties of the peptide. Derivative peptides include 
peptides modified with chemical labels to facilitate 
detection. Derivative peptides include branched and 

15 cyclized peptides. 

As used herein, "sulf opeptides" are peptides that contain 
sulfate moieties attached to one or more amino acids, such 
as tyrosine. In "sulf o-tyrosines" , a sulfate group replaces 
20 the para-hydroxyl group located on tyrosine side-chain. 

As used herein, "phosphopeptides" are peptides that contain 
phosphate moieties attached to one or more amino acids, 
such a tyrosine. In "phospho- tyrosines" , a phosphate group 
25 replaces the para-hydroxyl group located on tyrosine side- 
chain. 

The peptides of the subject invention may be sulfated when 
synthesized or they may be subsequently sulfated. For 
30 examTDle, means of sulfating the peptides include chemical 
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sulfation or enzymatic sulfation. One skilled in the art 
would know how to employ these and other techniques to 
sulfate the compound. 

This invention provides a composition which comprises a 
carrier and an amount of one of the compounds described 
herein effective to inhibit binding of HIV-1 to a CCR5 
receptor on the surface of a CD4+ cell. 

The carriers include but are not limited to an aerosol, 
intravenous, oral or topical 'carrier. Accordingly. The 
invention provides the above composition adapted for 
aerosol, intravenous, oral or topical application. 

This invention provides the above compositions and a 
pharmaceutically acceptable carrier . Pharmaceut ically 

acceptable carriers are well known to those skilled in - the 
art. Such pharmaceutically acceptable carriers may include 
but are not limited to aqueous or non-aqueous solutions, 
suspensions, and emulsions. Examples of non-aqueous 

solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic 
esters such as ethyl oleate . Aqueous carriers include 
water, alcoholic/aqueous solutions, emulsions .or 

suspensions, saline and buffered media. Parenteral 
vehicles include sodium chloride solution, Ringer's 
dextrose, dextrose and sodium chloride, lactated Ringer's 
or fixed oils. Intravenous vehicles include fluid and 
nutrient replenishers , electrolyte replenishers such as 
those based on Ringer's dextrose, and the like. 



-36- 



Preservatives and other additives may also be present, such 
as, for example, antimicrobials, antioxidants, chelating 
agents, inert gases and the like.- 

5 As used herein, "composition" means a mixture. The 
compositions include but are not limited to those suitable 
for oral, rectal, intravaginal , topical, nasal, opthalmic, 
or parenteral administration to a subject. As used herein, 
"parenteral" includes but is not limited to subcutaneous, 
10 intravenous, inti-amuscular , or intrasternal injections or 
infusion techniques. i 

As used herein, "administering" may be effected or performed 
using any of the methods known to one skilled in the art. 

15 The methods may comprise intravenous, intramuscular or 
subcutaneous means. As used herein, "effective dose" means 
an amount in sufficient quantities to either treat the 
subject or prevent the subject from becoming infected with 
HIV-1. A person of ordinary skill in the art can perform 

20 simple titration experiments to determine what amount is 
required to treat the subject. . 

This invention provides a method of inhibiting human 
immunodeficiency virus infection of a CD4+ cell which also 

25 carries a CCR5 receptor on its surface which comprises 
contacting the CD4-I- cell with an amount of one of the 
compounds described herein effective to inhibit binding of 
human immunodeficiency virus to the CCR5 receptor so as to 
thereby inhibit human immunodeficiency virus infection of 

30 the CD4+ cell. As used herein, "inhibits" means that the 



amount is reduced. In a preferred embodiment, inhibits 
means that the amount is reduced 100%. 

In one embodiment of this method, the CD4+ cell is present 
in a s\ibject and the contacting is effected by- 
administering the compound to the subject. 

This invention provides a method of preventing CD4+ cells 
of a subject from becoming infected with human 
immunodeficiency virus which comprises administering to the 
subject an amount of one of the compounds described herein 
effective to inhibit binding of human immunodeficiency 
virus to CCR5 receptors on the surface of the CD4+ cells so 
as to thereby prevent the subject's CD4+ cells from 
becoming infected with human immunodeficiency virus. 

This invention provides a method of treating a subject 
whose CD4+ cells are infected with human immunodeficiency 
virus which comprises administering to the subject an 
amount of one of the compounds described herein effective 
to inhibit binding of human imm.unodef iciency virus to CCR5 
receptors on the surface of the subject's CD4+ cells so as 
to thereby treat the subject. 

As used herein, human immunodeficiency virus includes but 
is not limited to HIV-1, which is the human 
immunodeficiency virus type-1. HIV-1 includes but is not 
limited to extracellular virus particles and the forms of 
HIV-1 found in HIV-1 infected cells. 
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As used herein, "KIV-1 infection" means the introduction of 
KIV-1 genetic information into a target cell, such as by 
fusion of the target cell membrane with HIV-1 or an HIV-1 
envelope glycoprotein* cell. The target cell may be a bodily 
5 cell of a subject. In the preferred embodiment, the target 
cell is a bodily cell from a human subject. 

As used herein, "inhibiting HIV-1 infection" means the 
reduction of the amount of HIV-l genetic information 
10 introduced into a target cell population as compared to the 
amount that would be introduced without the composition, 

I 

In the above methods, the compound may be administered by 
various routes including but not limited to aerosol, 

15 intravenous, oral or topical route. The administration may 
comprise intralesional , intraperitoneal, intramuscular or 
intravenous inj ection; infusion; liposome-mediated 

delivery; topical, intrathecal, gingival pocket, per 
rectum, intrabronchial , nasal, oral, ocular or otic 

20 delivery. In a further embodiment, the administration 
includes intrabronchial administration, anal, intrathecal 
administx-at ion or transdermal delivery. In another 
embodiment, the compound is administered hourly, daily, 
weekly, monthly or annually. In another embodiment, the 

25 effective amount of the compound comprises from about 
0.000001 mg/kg body weight to about 100 mg/kg body weight. 

The administr-at ion may be constant for a certain period of 
time or periodic and at specific intex"vals. The compound 
30 may be delivered hourly, daily, weekly, monthly, yearly 
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(e.g. in a time release form) or as a one time delivery. 
The delivery may be continuous delivery for a period of 
time, e.g. intravenous delivery. 

5 The carrier may be a diluent, an aerosol, a topical 
carrier, an ageuous solution, a nonaqueous solution or a 
solid carrier. 

The effective amount of the compound may comprise from 

10 about 0.000001 mg/kg body weight to about 100 mg/kg body 
weight. In one embodiment, 'the effective amount may 
comprise from about 0.001 mg/kg body weight to about >50 
mg/kg body ' weight . In another embodiment, the effective 
amount may range from about 0.01 mg/kg body weight to about 

15 10 mg/kg body weight. The actual effective amount will be 
based upon the size of the compound, the biodegradability 
of the compound, the bioactivity of the compound and the 
bioavailability of • the compound. If the compound does not 
degrade quickly, is bioavailable and highly active, a 

20 smaller amount will be requix-ed to be effective. The 
effective amount will be known to one of skill in the art ; 
it will also be dependent upon the form of the compound, 
the size of the compound and the bioactivity of the 
compound. One of skill in the art could routinely perform 

25 empirical activity tests for a compound to determine the 
bioactivity in bicassays and thus determine the effective 
amount . 

The compound of the present invention may be delivered 
30 locally via a capsule which allows sustained release of the 



agent or the peptide over a period of time. Controlled or 
sustained release compositions include formulation in 
lipophilic depots (e.g., fatty acids, waxes, oils). Also 
comprehended by the invention are- particulate compositions 
coated with polymers (e.g., poloxamers or poloxamines) and 
the agent coupled to antibodies directed against tissue- 
specific receptors, ligands or antigens or coupled to 
ligands of tissue-specific receptors. Other embodiments of 
the compositions of the invention incorporate particulate 
forms protective coatings, protease inhibitors or 
permeation enhancers for various routes of administration, 
including pax-enteral, pulmonary, nasal and oral. 

In one embodiment of the above methods, the subject is 
infected with HIV-1 prior to administering the compound to 
the subject. In one enibodiment of the above methods, the 
subject is not infected with HIV-1 prior to administering 
the compound to the subject. In one embodiment of the above 
methods, the subject is not infected with, but has been 
exposed to, human immunodeficiency virus. 

In one embodiment of the above methods, the effective 
amount of the compound comprises from about 1.0 ng/kg to 
about 100 mg/kg body weight of the subject. In another 
embodiment of the above methods, the effective amount of 
the compound comprises from about 100 ng/kg to about 5 0 
nig/kg body weight of the subject. In another embodiment of 
the above methods, the effective amount of the compound 
comprises from about 1 /^g/kg to about 10 mg/kg body weight 
of the subject. In another embodiment of the above methods. 
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the effective amount of the compound comprises from about 
100 //g/kg to about 1 mg/kg body weight of the subject. 

The dose of the composition of the invention will vary 
depending on the subject and upon the particular route of 
administration used. Dosages can range from 0.1 to 100,000 
Mg/kg. Based upon the composition, the dose can be 
delivered continuously, such as by continuous pump, or at 
periodic intervals. For example, on one or more separate 
occasions. Desired time intervals of multiple doses of a 
particular composition can be " determined without undue 
experimentation by one skilled in the art. 

As used herein, "effective dose" means an amount in 
sufficient quantities to either treat the subject or 
prevent the subject from becoming infected with HIV-1. A 
person of ordinary skill in the art can perform simple 
titration experiments to determine what amount is required 
to treat the subject. 

In one embodiment of the above method, the subject is a 
human being. As used herein, "subject" means any animal or 
artificially modified animal capable of becoming HIV- 
infected, Artificially modified animals include, but are 
not limited to, SCID mice with human immune systems. The 
subjects include but are not limited to mice, rats, dogs, 
guinea pigs, ferrets, rabbits, and primates. In the 
preferred embodiment, the subject is a human being. 



This invention provides a vaccine which comprises the 
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compound described herein. Vaccines comprising the 
sulf opeptides and a suitable adjuvant could be administered 
to a subject for the purposes of generating antibodies or 
other immune responses that are of therapeutic or 
•prophylactic value. For example, the vaccines could be 
administered for the purpose of generating in the subject 
antibodies that bind CCR5 and inhibit its ability to 
mediate HIV entry and infection, thereby protecting the 
subject from HIV infection or disease progression. The 
vaccines may also comprise a suitable adjuvant. The vaccine 
may also comprises a suitable carrier. ) 

The subject invention has various applications which 
includes HIV treatment such as treating a subject who has 
become afflicted with HIV, As used herein, "afflicted with 
HIV-1" means that the subject has at least one cell which 
has been infected by HIV-1. As used herein, "treating" 
means either slowing, stopping or x-eversing the progression 
of an HIV-1 disorder. In the preferred embodiment, 
"treating" means reversing the progression to the point of 
eliminating the disorder. As used herein, "treating" ^Iso 
means the reduction of the number of viral infections, 
reduction of the number of infectious viral particles, 
reduction of the number of virally infected cells, or the 
amelioration of symptoms associated with HIV-1, Another 
application of the subject invention is to prevent a 
subject- from contracting HIV. As used herein, "contracting 
KIV-1" means becoming infected with KIV-1, whose genetic 
information replicates in and/or incorporates into the host 
cells. Another application of the subject invention is to 
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treat a subject who has become infected with HIV-1. As used 
herein, "HIV-1 infection" means the introduction of HIV-1 
genetic information into a targe-t cell, such as by fusion 
of the target cell membrane with HIV-1 or . an HIV-1 envelope 
5 glycoprotein-" cell. The target cell may be a bodily cell of 
a subject. In the preferred embodiment, the target cell is 
a bodily cell from a human subject. Another application of 
the subject invention is to inhibit HIV-1 infection. As 
used herein, "inhibiting HIV-1 infection" means reducing 
10 the amount of HIV-1 genetic information introduced into a 
target cell population as compared to the amount that would 
be introduced without said composition. 

This invention provides a method of identifying an agent 
15 which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

(a) immobilizing one of the compounds described' herein on 
a solid support;. 

(b) contacting the immobilized compound from step (a) with 
20 sufficient detectable CCR5 ligand to saturate all 

binding sites for the CCR5 ligand on the immobilized 
compound under conditions permitting binding of the 
CCR5 ligand to the immobilized compound so as to form 
a complex; 

25 (c) removing any unbound CCR5 ligand; 

(d) contacting the complex from step (b) with the agent; 
and 

(e) detecting whether any CCR5 ligand is displaced from 
the complex, whex^ein displacement of detectable CCR5 

30 ligand from the complex indicates that the agent binds 
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to the. compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

(a) contacting one of the compounds described herein with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 
under conditions permitting binding of the CCR5 ligand 
to the compound so as to form a complex; - 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of CCR5 ligand which is bound to 
the compound in the complex; 

(d) contacting the complex, from step (a) with the agent so 
as to displace CCR5 ligand from the complex; 

(e) measuring the amount of CCR5 ligand which is bound to 
the compound in the presence of the agent; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) with the amount measured in atep 
(c) , wherein a reduced amount measured in step (e) 
indicates that the agent binds to the compound so as 
to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
receptor which comprises: 

(a) imimobilizing one of the compounds described herein on 
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a solid support; 

(b) contacting the immobilized compound from step (a) with 
the agent and detectable CCR5 ligand under conditions 
permitting binding of the' CCR5 . ligand to the 
immobilized compound so as to form a complex; 

(c) removing any unbound CCR5 ligand; 

(d) measuring the amount of detectable CCR5 ligand which 
is bound to the immobilized compound in the complex; 

(e) measuring the amount of detectable CCRS ligand which 
binds to the immobilized compound in the absence of 
the agent ; 

(f) comparing the amount of CCRS ligand which is bound to 
the immobilized compound in step (e) with the amount 
measured in step (d) , wherein a reduced amount 
measured in step (d) indicates that the agent binds to 
the compound so as to thereby identify the agent as 
one which inhibits binding of the CCRS ligand to the 
CCRS receptor. 

In one embodiment of the above method, the amount of the 
detectable CCRS ligand in step (a) and step (e) is 
sufficient to saturate all binding sites for the CCRS 
. ligand on the compound. 

This invention also provides a method of identifying an 
agent which inhibits binding of a .CCRS ligand to a CCRS 
receptor which comprises: 

(a) contacting one of the compounds described herein with 
the agent and detectable CCRS ligand under conditions 
permitting binding of the CCRS ligand to the compound 



so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of detectable CCR5 ligand which 
is bound to the compound in tHe complex; 

(d) measuring the amount of detectable CCR5 ligand which 
binds to the compound in the absence of the agent; 

(e) comparing the amount of CCR5 . ligand which is bound to 
the compound in step (c) with the amount measured in 
step (d) , wherein a reduced amount measured in step 
(c) indicates that the agent binds to the compound so 
as to thereby identify the "agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

In one embodiment of the above method, the amount of the 

detectable CCR5 ligand in step (a) and step (d) is 

sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 



In one embodiment of the above method the solid support is 
a microtiter plate well. In another embodiment, the solid 
support is a bead. In a further embodiment, -the so.lid 
.support is a surface plasmon resonance sensor chip. The 
surface plasmon resonance sensor chip can have pre- 
imimobilized streptavidin . In one embodiment, the surface 
plasmon resonance sensor chip is a BIAcore™ chip. 

In one embodiment of the above methods, the detectable CCR5 
ligand is labeled with a detectable marker. In another 
embodiment of the above methods, the CCR5 ligand is 
detected by contacting it with another compound which is 
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both capable of detecting the CCR5 ligand and is 
detectable. The detectable markers include those described 
above . 

5 This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

a) immobilizing one of the compounds described 
herein on a solid support; 
10 b) contacting the immobilized compound from step a) 

with the agent dissolved or suspended in a known 
vehicle and measuring the binding signal 
generated by such contact; 

c) contacting the immobilized compound from step a) 
15 with the known vehicle in the absence of the 

compound and measuring the binding signal 
generated by such contact; 

d) comparing the binding signal measured in step b) 
with the binding signal measured in step c) , 

20 wherein an increased amount measured in step b) 

indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

25 In one embodiment of the above method, the solid support is 
a surface plasmon resonance sensor chip. In another 
embodiment of the above method, the binding signal is 
measured by surface plasmon resonance. 



30 This invention provides a method of obtaining a composition 
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which comprises: 

(a) identifying a compound which inhibits binding of a 
CCR5 ligand to a CCR5 receptor according to one of the 
above methods ; and 

(b) admixing the compound so identified or a homolog or 
derivative thereof with a carrier. 



The invention provides agents identified in the screen. 
Such agents may have utility in treating HIV-1 infection or 
10 other CCR5 -mediated diseases, which include rheumatoid 
arthritis, asthma, multiple sclerosis, psoriasis, 
atherosclerosis and other inflammatory diseases. 
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In one embodiment of the above methods, the CCR5 ligand is 
a comtplex comprising an HIV-1 envelope glycoprotein and a 
CD4-based protein. The HIV-1 envelope glycoproteins include 
but are not limited to gpl20, gpl40 or gpl60. The CD4-based 
proteins include but are not limited to soluble CD4 or CD4- 
IgG2 . 



As used herein, "CD4 " means the mature, native, membrane - 
bound CD4 protein comprising a cytoplasmic domain, a 
hydrophobic transmembrane domain, and an extracellular 
domain that binds to the HIV-1 gpl20 envelope glycoprotein. 
25 As used herein, "KIV-i envelope glycoprotein'* means the HIV- 
1 encoded protein which comprises the gpl20 surface 
protein, the gp41 transmembrane protein and oligomers and 
precursors thereof. As used herein, "CD4-based protein" 
means any px-otein comprising at least one sequence of amino 
acid residues cori-esponding to that poi'tion of CD4 which is 



30 



-49- 

reguired for CD4 to form a complex with the HIV-1 gpl20 
envelope glycoprotein. As used herein, "CD4-IgG2" means a 
heterotetrameric CD4 -human IgG2 fusion protein encoded by 
the expression vectors deposited under ATCC Accession 
5 Numbers 75193 and 75194. 

In one embodiment of the above methods, the CCR5 ligand is 
a chemokine . The chemokines include but are not limited to 
RAWTES, MlP-la or MIP-lp. As used herein, "RANTES", "MlP-la", 

10 and "MIP-lp" denote members of the chemokine superfamily of 
proteins that direct the activation and migration of 
leukocytes and other cells involved in the inflammation. 
RANTES, MI?-la and MIP-ip are known to bind CCR5 and induce 
signaling. Their peptide sequences have been described 

15 (Wells et al . J. Leukocyte Biology, 59:53-60, 1996). 

In one embodiment of the above methods, the CCR5 ligand is 
an antibody. In one embodiment, the antibody is PAS (ATCC 
Accession No. HB-12605) . In another embodiment, the 
20 antibody is PAIO (ATCC Accession No. 12607). In another 
embodiment, the antibody is PAll (ATCC Accession No. ,HB- 
12608) . In another embodiment, the antibody is PA12 (ATCC 
Accession No. HB-12609) . 

25 This invention provides a compound having the structure: 
A- (aYDINYYTSE(3A)„ 

wherein each T represents a threonine, each S represents a 
serine, each .E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
30 each I represents an isoleucine; and each N represents an 
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asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein X 
represents a carboxyl group or an ami dated carboxyl group; 
wherein all of a, Y, D, I , N, Y, Y , T, S , E and p are joined 
together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

This invention also provides a compound having the 
structure: 

( eaYDINYYTSEP ) „ " A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; v;herein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
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order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the ^amino terminal direction; 
wherein (3 represents from 0 to 13 amino acids, with the 
■proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 6 
represents an amino group or an acetylated amino group; 
whei-ein all of a , Y , D , I , N, Y , Y', T , S , E and 3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, wherein n is an integer from 1 to 8, 
A is a' polymex", and the solid line represents up to 8 
linkers which attach the structure in parentheses to A. 

This invention provides a compound having the structure: 

A- (aYDINYYTSEpX) n 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and ' have a sequence identical to the sequence set 
forth in SEQ ID NO : 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein (3 represents from 0 to 3 33 amino acids, with the 
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proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at 'position 19 and extending 
5 therefrom in the carboxy terminal direction; wherein X 
represents a carboxyl group or an amidated carboxyl group; 
wherein all of a, Y, D, I , N, Y, Y, T, S , E and 3 are joined 
together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
10 integer from 1 to 8, A is a polymer, and the solid line 
repx-esents up to 8 linkers which attach the structure in 
parentheses to A. 

This invention also provides a compound having the 
15 structure: 

( eaYDINYYTSEp ) „ " A 

wherein each T represents a threonine, each S represents a 
serine, each E represents. a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 

20 each I r-epresents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 

25 forth in SEQ ID NO : 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein 3 represents from 0 to 333 amino acids, with the 
proviso that if there are more than 2 amino acids, they ai'e 
joined together by peptide bonds in consecutive order and 

30 have a sequence identical to the sequence set forth in SEQ 
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ID KfO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 0 
represents an amino group or an acetylated amino group; 
wherein all of a, Y, D, I , N, Y , T/S , E and (5 are joined 
5 together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

10 

The polymer of the above compounds includes but is not 
limited to the following: a linear lysine polymer; a 
branched lysine polymers; a linear arginine polymer; a 
branched arginine polymer; and polyethylene glycol (PEG) , a 
15 linear acetylated lysine polymer, a branched acetylated 
lysine polymer, a linear chloroacetylated lysine polymer 
and a branched chloroacetylated lysine polymer. 

The above compounds can be produced by various methods 
20 known to those skilled in the art, including but not 
■ limiited to the following. Methods for producing synthetic 
multimeric peptides such as multiple antigen peptides, 
synthetic polymeric constructs, and branched lysine 
oligopeptides are well known to those skilled in the art 
25 (Spetzler and Tam, Int. J. Pept . Prot . Res. 45:78, 1995; 
Yai et al . , J- Virol., 69:320, 1995; Okuda et al . , J. Mol . 
Recognit. 6:101, 1993). For example, radially branched 
peptides can be produced by performing standard solid-phase 
peptide synthesis methods using branched lysine skeletons 
30 on 4 - (oxy-methyl) -phsnylactamidomethyl or other suitable 
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solid resin. Peptide chains are elongated in parallel in a 
stepwise fashion using optimized t -butyl oxycarbonyl /benzyl 
chemistry as described (Sabatier et al . , Biochemistry 
32:2763, 1993). Peptides are liberated from the resin, 
•purified by reversed-phase chromatography over a C18 or 
other suitable column and characterized by analytical HPLC 
and mass spectroscopy. In another approach, monomeric 
peptides are synthesized, purified, and then covalently 
coupled to lysine copolymers using N- succinimidyl maleimido 
carboxylate chemistry. In another approach, the peptides 
can also be made in the form of affinity type multimers. 
For example, peptides miay be synthesized with an affinity 
tag such as biotin. These affinity tagged peptides can then 
be mixed with affinity ligands capable of binding 
multimerically , such as streptravidin . Other site - speci f ic 
ligation chemistries are known to the skilled artisan. 

This invention provides a compound comprising the 
structure: 

eccYDnnynnnEpA 

wherein each E represents a glutamic acid, each - D 
represents an aspartic acid, and each Y represents a 
tyrosine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein [3 represents from 0 to 13 amino acids, with the 
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proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence^ set forth in SEQ ID NO: 
1 beginning with the P at position 19 and extending 
5 therefrom in the carbcxy terminal direction; 

wherein 6 represents an amino group or an acetylated amino 
group; wherein X represents a carboxyl group or an amidated 
carboxyl gi-oup; 

wherein n represents any amino acid, 
10 wherein all of a, Y, D, n, n, Y, n, n, n, E and (3 are joined 
together by peptide bonds; 

further provided that at least two tyrosines in the 
compound are sulfated. 

15 In one embodiment of this compound, the compound comprises 
amino acids in addition to those in the YDnilYnnilE peptide, 
and such amino acids correspond to those present in the 
CCR5 receptor sequence set forth in SEQ ID NO : 1 , yet an 
amino acid may be replaced with a homologous amino acid. 

20 The sequence YDIITIYnnriE corresponds to amino acid residues 
10-18 of the sequence set forth in SEQ ID N0:1.- '/For 
example, if the peptide has one additional amino acid on 
its N terminal end, then the sequence could be lYDnnYIinnE 
or alternatively, the I could be replaced with G, A, V or 

25 L. 

In one embodiment of the above compound, the compound is a 
peptide which comprises consecutive amino acids having the 
sequence YDnnYnnilE . 

30 
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In one embodiment of the above compound ^ the tyrosines at 
positions 1 and 5 of the sequence YDnilYnnnE are sulfated. 

As used herein, "homologous amino acids" are those which 
have chemiically similar side chains. For example, aliiohatic 
side chains (G, A, V, L and I) ; aromatic side chains (F, Y 
and W) / basic aide chains (K, R and H) ; acidic side chains 
(D and E) ; amide side chains (N and Q) ; aliphatic hydroxyl- 
containing side chains (S and T) ; sulfur-containing side 
chains (C and M) . Homology between amino acids may also be 
drawn on other* bases, such as size, polarity, hydrogen 
bonding potential, hydrophilicity and hydrophobicity . 
Proline differs from the above amino acids in that it 
contains a secondary rather than primary imino group. 
Accordingly, proline m^ay be considered an imino group. 
Substitution or proline with another amino acid (e.g. G, A 
or S) can increase the flexibility of a peptide. 
Conversely, substitution of another amino acid with a 
proline can stabilize a desired conformation. 

This invention provides a compound comprising • vthe 

structure ; 

eaYDINYYTSEpA 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutam.ic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if thex^e ax-e more than 2 amino acids, they are 



-57- 

joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 
5 wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the P at position 19 and extending 
10 therefrom in the carboxy terminal direction; 

wherein 0 represents an amino group or an acetylated amino 
group; wherein X represents a cax-boxyl group or an amidated 
carboxyl group; 

wherein all of a, Y , D, I , N, Y , Y , T , S , E and p are joined 
15 together by peptide bonds; 

further pi'ovided that at least two tyrosines in the 
compound are sulfated, 

wherein any amino acid except for the Y at position 1, D at 
position 2f Y at position 5 and E at position 9 may be 
20 replaced with a homologous amino acid. 

In one embodiment of -the above compound, with respect to 
replacing homologous amino acids, any I amino acid residue 
may be replaced with a G,A,V or L amino acid residue. In 

25 one embodiment of the above compound, any N amino acid 
residue may be replaced with a Q amino acid residue. In one 
embodiment of the above compound, any Y amino acid residue 
may be replaced with a F or W amino acid residue. In one 
embodiment of the above compound, any T amino acid r-esidue 

30 may be replaced with a S amino acid residue. In one 



-58- 

embcdiment of the above compound, any S amino acid residue 
may be replaced with a T amino acid residue. In one 
embodiment of the above compound, any C may be replaced 
with M,S,T,A,G,N, or Q . 

5 

In one embodiment, a C amino acid x^esidue within the 3 
region of the compound miay be replaced with any other amino 
acid. 

10 This invention provides an agent which binds to an epitope 
of HIV-1 gpl20, which epitope comprises amino acid residues 
R298, N301, T303, 1322, D324, 1325, R326, 1420, K421, Q422, 
W427, thereby inhibiting binding of HIV-1 gpl20 to a CCR5 
chemokine receptor . 

15 

The above amino acid numbering is per KIV-1 strain KxB2 
(Genbank Accession No. AA350262). Amino acids D324, 1325 
and R326 are dex'ived from KIV-1 strain JR-FL (Genbank 
Accession No. AAB05604) . 

20 



25 



30 



The amino acid sequence (SEQ ID NO:17) for HIV-1 Hx32 gpl20 
is set forth below: 

1 MRVKEKYQHL WRWGWRWGTM LLGMLMICSA TEKLWVTVYY GVPVWKEATT TLFCASDAKA 
61 TOTEVKNVWA TKACVPTDPN PQEWLVKVT ENFl^KWKNDM VEQMEEEIIS LWDQSLKPCV 
121 KLTPLCVSLK CTDLKNBTOT NSSSGRMIMS KGEIKNCSFN ISTSIRGKVQ KEYAFFYKLD 
181 IIPIDNDTTS YKLTSCNTSV ITQACPICv/SF EPIPIKYCAP AGFAILKCNN KTFNGTGPCT 
241 NVSTVQCTHG IRPWSTQLL LNGSIASEEV VIRSVNFTDN AKTIIVQLNT SVEINCTRPN 
301 NNTRKRIRIQ RGPGRAFVTI GKIGNMRQAH CNISPJUCWNN TLKQIASKLR EQFGNNKTII 

3 61 FKQSSGGDPE IVTHSFNCGG EFFYCNSTQL FNSTWFNSTW STEGSNNTEG SDTITLPCRI 
421 KQIINMWQKV GK.AMYAPPIS GQIRCSSNIT GLLLTRDGGN SNNESEIFRP GGGDMRDNWR 

4 81 SELYKYKWK lEPLGVAPTK AKRRWQREK R 



The amino acid sequence (SSQ ID NO: 16) for HIV-1 JR-FL 
gpl20 is set forth below: 

1 MRVKGIRKSY QYLWKGGTLL LGILMICSAV EKLWVTVYYG VPVWKEATTT LFCASDAKAY 
61 DTSVKim^AT KACVPTDPNP QEWLENVTE KFNMWKNNMV EQMQEDIISL WDQSLKPCVK 
121 LTPLCVTLNC KDVNATNTTN DSEGTMERGE IKNCSFNITT SIRDEVQKEY ALFYKLDWP 
ISl IDNNNTSYRL ISCDTSVITQ ACPKISFEPI PIHYCAPAGF AILKCNCKTF NGKGPCKNVS 
241 TVQCTHGIRP WSTQLLLNG SLASESWIR SDNFTNNAKT IIVQLKESVE INCTRPNNKT 
3 01 RKSIHIGPGR AFYTTGEIIG DIRQAKCNIS RAKWNDTLKQ IVIICLREQFE NKTIVFNHSS 

3 61 GGDPEIVMHS FNCGGEFFYC NSTQLFNSTW Nl^^TEGSNNT EGNTITLPCR IKQIINMWQE 
421 VGICAMYAPPI RGQIRCSSNI TGLLLTRDGG INENGTEIFR PGGGDMRDNW RSELYKYKW 

4 81 KIEPLGVAPT KAKRRWQRE KR .... 

This invention provides the above agent, wherein the 
epitope is altered or masked by an alanine substitution of 
at least one of the amino acid residues R298, N301, T303, 
1322, D324, 1325, R326, 1420, K421, Q422 and W427. 

This invention provides an agent which binds to an epitope 
of HIV-1 gpl20, which epitope comprises amino acid residues 
R298, N301, T303, 1322, D324, 1325, R326, 1420, K421, Q422, 



W427, thereby inhibiting HIV-1 infection of a CD4+ CCR5+ 
cell. 

This invention provides the above agent, wherein the 
epitope is altered or masked by an alanine substitution of 
at least one of the amino acid residues R298, N3 01, T3 03, 
1322, D324, 1325, R326, 1420, K421, Q422 and W427. 

In one embodiment of any of the above agents, the agent is 
a peptide. In one embodiment of any of the above agents, 
the peptide comprises consecutive amino acids having the 
sequence YDINYYTSE. In one embodiment at least two 
tyrosines in the compound are sulfated. In one embodiment, 
the tyrosines at positions 1 and 5 of the sequence 
YDINYYTSE are sulfated. 

In one embodiment of any of the above agents, the agent is 
an antibody or portion of an antibody. In one embodiment of 
any of the above agents, the agent is a nonpeptidyl agent. 
In one embodiment of " any of the above agents, the agent is 
a peptidyl agent. 

This invention provides a method of inhibiting KIV-1 
infection of a CD4+CCR5+ cell which comprises contacting 
the cell with an amount of an agent of the subject 
invention effective to bind to HIV-1 gpl20, so as to ' 
thereby inhibit HIV-1 infection of the CD4+ CCR5+ cell. 



This invention provides 
following structures: 



a compound having one of the 
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A- (aYDINYYTSEpA) , ( 6aYDINYYTSE& ) -A, or A- (aYDINYYTSEP) "A 
wherein each" T represents a threonine, each S • represents a 
serine, each E represents a ..glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein A represents a carboxyl group or an amidated 
20 carboxyl group; 

wherein 6 represents an amino group or an acetylated amino 
group.; 

wherein all of a , Y, D, I , N, Y , Y , T , S , E and 3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein A is a molecule that self -ol igomerizes , and the 
solid line represents a peptide linker or a peptide, 
disulfide, or other chemical bond. 
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As used herein "peptide linker" is a peptide comprising 
consecutive amino acids having a sequence which includes 
but is not limited to GAG, -SGGRGG and QSTRGGASGGG or 
repeating units thereof. One skilled in the art would know 
other flexible peptide linkers. 

In one embodiment of the above compound, the peptide that 
self-oligomerizes contains alpha-helical regions capable of 
forming coiled coils. 

The a-helical coiled coil (48) is probably the most 
widespread subunit oligomerizat ion motif found in proteins 
(48-52) . It is a type of protein structure consisting of 
two to five amphipathic a-helices that "coil" around each 
other in a left-handed supertwist (48-52) . The sequences of 
coiled are characterized by a heptad repeat of seven 
residues with a hydrophobic repeat of mostly apolar amino 
acids . 

In one embodiment of the above compound, the peptide that 
self-oligomerizes is a peptide having a sequence of.- at 
least a portion HIV-1 gp41 heptad repeat sequence 1. In one 
embodiment, the HIV-1 gp41 heptad repeat sequence 1 is 

RQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ (SEQ ID 
NO: 3) . 

In one embodiment of the above compound, the peptide that 
self-oligomerizes is a peptide having a sequence of at 
least a portion of an HIV-i gp4l heptad repeat sequence 2. 
In one embodiment, the HIV-i gp4i heptad repeat sequence 2 
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is WMEWDREINNYTSLIH5LIEESQNQQEKI\rEQELLE (SEQ ID NO: 4) . 

In one embodiment of the above ' compound, the peptide that 
self -oligomerizes is a peptide having a sequence 
coi-responding to at least a portion of the leucine zipper 
region of transcription factor GCN4 . In one embodiment; 
the sequence of the leucine zipper region of transcription 
factor GCN4 is HMKQLEDKVEELLSKITYKLENEVARLKKLVGER (SEQ ID 
NO : 6 ). . 

In one embodiment of the above compound, I the peptide that 
self -oligomerizes is a peptide having a sequence 
corresponding to at least a por-tion of the leucine zipper 
region of transcription factor GCN4 . In one embodiment, the 
sequence is derived from the leucine zipper region of 
transcription factor GCN4 . In one embodiment, the sequence 
forms trimeric coiled-coils . In one embodiment, the 
sequence is HMKQIEDKIEEILSKIYHIENEIARIKKLIGEV (SEQ ID 
NO : 7 ) . 



20 



In one embodiment of the above compound, the peptide *that 
self -oligomerizes is a peptide having a sequence 
corresponding to at least a portion of a leucine zipper 
region of a human protein. The human protein includes but 

25 is not limited to transcription activator c-fos, 
transcription activator c-jun, enzyme quiescent cell 
proline dipeptidase, macrophage scavenger receptor, 
salivary mucin (MUC7) , or human quiescent cell proline 
dipeptidase (QPP) . In one embodiment, the human protein is 

30 QPP and the leucine zipper region has the sequence 
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LLTVEQALADFAELLRALRRDL (SEQ ID NO-5) Tr, w ' 

^ ijj Nu.b) . In one embodiment, 

the transcription activator is c-^os anrf i 

■L ±b c ^os and the leucxne zipper 

region has the se 

LTDTLQAETDQLEDEKSALQTEIANLLKEKEKDfeFILAAR (SEQ ID lo-^^ \l 
5 one en^odiment, the transcription activator is c-jun " and 
the leucine .ippe. _.egion has the sec^ence 
KMRRIARLEEKVKTLKAQNSELASTANMLREQVAOLKQKY (SEQ ID NO: 9). 

in one embodiment of the above compound, the peptide that 
10 self-oligomeri.es is a peptide having a sequence 
corresponding to that of at least a portion of an antibody 
in one embodiment, the portion of the antibody comprises 
the heavy. Chain. In one embodiment, the portion of the 
antibody heavy chain comprises the heavy chain constant 
reg.on.. In one eoO^odiment, the portion of the antibody 
heavy chain comprises the hinge and Fc domains, m one 
embodiment, the portion of the antibody ' heavy chain 
comprises the Fc domain. In one embodiment, the portion of 
the antibody comprises the light chain. In one embodiment 
the portion comprises the light chain constant region In ' 
one embodiment, the portion of the antibody comprises vthe 
heavy and light chains. 

This invention provides a compound having one of the 

2^ following structures: 

A- (CYDINVYTSEBX) , (SaYDINYYTSEB) "A, or A- (cYDimTYTSEP ) - A 

Wherein each T represents a threonine, each s represents a 

serine, each E f eoi-esenh c: = 

-trp-v-sencs a glutamic acid, 
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each Y 

represents a tyrosine,- each D represents an aspartic acid 
each I represents an isoleucine; and each N represents an 
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asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein X represents a carboxyl group or an amidated 
carboxyl group ; 

wherein 0 represents an amino group or an acetylated amino 
group ; 

wherein all of a, Y, D, I , N, Y, Y, T , S , E and 3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, • 
wherein A is toxin, and the solid line represents a peptide 
linker or a peptide, disulfide, or other chemical bond. 

In one embodiment of the above compound, the toxin is a 
radionuclide. In one embodiment, the radionuclide is an 
alpha-emitting isotope. The alpha-emitting isotope includes 
but is not limited to ^^^Ac , ^—At, ^^^B± , or ^^^B± . In one 
eiTibodim.ent , the radionuclide is a beta-emitting isotope. 
The beta-emitting isotope includes but is not limited to 
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' ^-"^ Iri one embodiment, the 

radionuclide may be emitting Auger and low energy electron. 
The radionuclide includes but is not limited to "^i, i^sj or 
"Br . 



m one embodiment of the above compound, the toxin is a 
chemical toxin. The chemical toxin may be a peptidyl 
chemical toxin. The peptidyl chemical toxin includes but is 
not limited to ricin. The chemical toxin may be a 
nonpeptidyl chemical toxin. The nonpeptidyl chemical toxin 
includes but is not limited to calicheamycin . 

This invention provides a compound having one of the 
following structures : 

A- (aYDINYYTSEpX) , (BaYDINYYTSEP) -A, or A- (aYDINYYTSEg ) -A, 
wherein each T represents a threonine, each S represents a 
.serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an . 
asparagine; 

wherein a represents from 0 to 9 amino acids, with" the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein p represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SSQ 
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ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein X represents a carboxyl group or an amidated 
carboxyl group; 

wherein 6 represents an amino group or an acetylated amino 
group.; 

wherein all of a, D, I , N, Y, T, S , E and p are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein A is molecule with anti-HIV activity, and the solid 
line represents a peptide linker or a peptide, disulfide, 
or other chemical bond. 

In one embodiment of the above compound, the molecule with 
anti-HIV activity is a CD4 - immunoglobulin fusion protein. 
In one embodiment, the CD4 -immunoglobulin fusion protein is 
CD4-IgG2, wherein the CD4-IgG2 comprises two heavy chains 
and two lights chains, wherein the heavy chains are encoded 
by an expression vector designated CD4 - IgG2KC-pRcCMV (ATCC 
Accession No. 75193) and the light chains are encoded ■ by -an 
expression vector designated CD4 -kLC-pRcCMV (ATCC Accession 
No. 75194) . 

In one errbodimient of the above compound, the molecule with 
anti-HIV activity is a compound which retai'ds gp41 from 
adopting a conf ormiat ion capable of mediating fusion of HIV- 
1 to a CD4+ cell by binding ncncovalent ly to an epitope on 
a gp41 fusion intermiediate . In one embodiment, 'the compound 
comprises a peptide selected from the group consisting of 
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T-20 (SEQ ID NO: 10), DP107 (SSQ ID NO: 11), N34 (SEQ ID 
NO: 12), C28 (SEQ ID NO: 13), N34(L6)C28 (SEQ ID NO: 14), 
and T1249 (SEQ ID NO:15) . 

As used herein, -7-20" and "DP178" are used interchangeably 
to denote a peptide having the following amino acid 
sequence: YTSLIKSLIEESQNQQSKNSQELLELDKWASLWNWF (SEQ ID 
NO:10) and as described [53,54]. 

DP107 has the following amino acid sequence: 
NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ (SEQ ID NO : 11 ) 

, N34 has the following amino acid sequence: 
SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQAR (SEQ ID NO: 12) 

C2 8 has the following amino acid sequence: 
WMEWDREINNYTSLIKSLIESSQNQQEK (SEQ ID NO: 13) 

N34(L6)C28 has the following amino acid sequence: 

SGIVQQQNNLLRAISAQQHLLQLTVWGIKQLQARSGGRGGWMEWDREINNYTSLIHSLI 
ESSQNQQEK (SEQ ID NO: 14) 

T1249 has the following amino acid sequence : 
WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF (SEQ ID NO: 15) 

This invention provides the above compound wherein the 
molecule with anti-KIV activity is a CCR5 chemokine 
receptor targeting agent. 



In one errtbodiment , the CCR5 chemokine receptor targeti 
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agent is an antibody or portion of an antibody. In one 
embodiment, the antibody includes but is not limited to PAS 
(ATCC Accession No. H3-12605'), ?A10 (ATCC Accession 
No. 12607), PAll (ATCC Accession No. HB-12608), PA12 (ATCC 
Accession No. KB-12609) , and PA14 (ATCC Accession No. HB- 
12610) , In one embodiment, the antibody is PA14 (ATCC 
Accession No. HB-12610) . 

The antibody may be a monoclonal antibody or polyclonal 
antibody. The monoclonal antibody m.ay be a human, humanized 
or chimeric antibody. This invention provides humanized 
forms of the above antibodies. 

As used herein, "humanized" describes antibodies wherein 
some, most • or all of the amino acids outside the CDR 
regions are replaced with corresponding amino acids derived 
from human im.munoglobulin molecules. In one embodiment of 
the humanized forms of the antibodies, some, most or all of 
the amino acids outside the CDR regions have been replaced 
with amino acids from human imimunoglobulin molecules but 
where some, most or all amino acids within one or more*-- CDR 
regions ar-e unchanged. Small additions, deletions, 
insertions, substitutions or modifications of amino acids 
are permissible as long as they would not abrogate the 
ability of the antibody to bind a given antigen. Suitable 
human immunoglobulin molecules would include IgGl, IgG2 , 
IgG3 , IgG4, IgA and IgM molecules. A "humanized" antibody 
would retain a similar antigenic specificity as the 
original antibody, i.e., in the present invention, the 
ability to bind CCR5 . 



One skilled in the art would know how to make the humanized 
antibodies of the subject invention. Various publications, 
several of which are hereby incorporated by reference into 
this application, also describe how to make humanized 
antibodies. For example, the methods described in United 
States Patent No. 4,816,557 (55) comprise the production of 
chimeric antibodies having a variable region of one 
antibody and a constant region of another antibody. 

united States Patent No. 5,225,539 (56) describes another 
approach for the production of a humanized antibody. This 
patent describes the use of recombinant DNA technology to 
produce a humanized antibody wherein the CDRs of a variable 
region of one immunoglobulin are replaced with the CDRs 
from ah immunoglobulin with a different specificity such 
that the humanized antibody would recognize the desired 
target but would not be recognized in a significant way by 
the human subject's immune system. Specifically, site 
directed mutagenesis is used to graft the CDRs onto the 
framework. 

Other approaches for humanizing an antibody are described 
in United States Patent Nos . 5,585,089 (57) and 5,693,761 
(58) and WO 90/07861 which describe methods for producing 
humanized immunoglobulins. These have one or more CDRs and 
possible additional amino acids from a donor immunoglobulin 
and a framework region from an accepting human 
immunoglobulin. These patents describe a method to increase 
the affinity of an antibody for the desired antigen. Some 
amino acids in the framework are chosen to be the same as 
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the amino acids at those positions in the donor rather than 
in the acceptoi". Specifically, these patents describe the 
preparation of a humanized antibody that binds to a 
receptor by combining the CDRs of a mouse monoclonal 
5 antibody with human immunoglobulin framework and constant 
regions. Human fi'amework regions can be chosen to maximize 
homology with the mouse sequence. A computer model can be 
used to identify amino acids in the framework region which 
are likely to interact with the CDRs or the specific 
0 antigen and then mouse amino acids can be used at these 
positions to create the humanized antibody. 

The above patents 5,585,089 and 5,693,761, and WO 90/07861 
(59) also propose four possible criteria which may used in 
designing the hum^anized antibodies. The first proposal was 
that for an acceptor, use a framework from a particular 
human immunoglobulin that is unusually homologous to the 
donor immunoglobulin to be humanized, or use a consensus 
framework from many human antibodies. The second proposal 
was that if an amiino acid in the framework of the human 
immunoglobulin is unusual and the donor amino acid at- that 
position is typical for humian sequences, then the donor 
amino acid rather than the acceptor may be selected. The 
third proposal was that in the positions immediately 
adjacent to the 3 CDRs in the humianized immunoglobulin 
chain, the donor amino acid rather than the acceptor amino 
acid may be selected. The fourth proposal was to use the 
donor amino acid reside at the fram^ework positions at which 
the amino acid is predicted to have a side chain atom 
within 3A of the CDRs in a three dimensional model of the 



antibody and is predicted to be capable of interacting with 
the CDRs. The above methods are merely illustrative of some 
of the methods that one skilled -in the art could employ to 
make humanized antibodies. 

This invention provides the above compound, wherein the 
portion of the antibody is a Fab fragment of the antibody. 
This invention provides the above compound, wherein the 
portion of the antibody comprises the variable domain of 
the antibody. This invention provides the above compound, 
wherein the por-tion of the antibody comprises a 
complementarity determining region or CDR portion of the 
antibody. The monoclonal antibody includes but is not 
limited to an IgG, IgM, IgD, IgA, or IgE monoclonal 
antibody . 

This invention provides the above compound, wherein the 
molecule with anti-HIV activity is a chemokine or chemokine 
derivative. The chemokine includes but is not limited to 
RANTES, MlP-la, MIP-13, SDF-1 or other chemokine which 
blocks HIV-1 infection. The chemokine derivative includes 
but -is not lim.ited to Met-RANTES, AOP-RA^TES, RANTES 9-68, 
or ]SrbJY-RANTES . 

The molecule may also be a non-chemokine agent capable of 
binding to chemokine receptors and inhibiting fusion of 
HIV-1 to CD4* cells. The non-chemokine agents include, but 
are not limited to, chemokine fragments and chemokine 
derivatives and analogues. in one embodiment, the agent 
does not include naturally occurring chemokines . The non- 
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chemokine agents include mult imeric forms of the chemokine 
fragments and chemokine derivatives and analogues or fusion 
molecules which contain chemokitae^ fragments, derivatives 
and analogues linked to other molecules. In one 

5 embodiment, the non-chemokine agents do not include 
bicyclams and their derivatives as described in U.S. Patent 
No. 5,021,409, issued June 4, 1991, the content of which is 
incorporated by reference into this application. Some 
bicyclam derivatives have been previously described with 
10 antiviral activities (60, 61) . 

In an embodiment of this invention, the non-chemokine agent 
is an oligopeptide. In another embodiment, the non- 

chemokine agent is a polypeptide. In still another 

15 embodimient , the non-chemokine agent is an antibody or a 
portion thereof. Antibodies against the chemokine receptor 
may easily be generated by routine experiments. It is also 
within the level of ordinary skill to synthesize fragments 
of the antibody capable of binding to the chemokine 

20 receptor. In a further embodiment, the non-chemokine agent 
is a nonpeptidyl agent such as TAK-779 (64) or AMD3100 
(65) . 

Non-chemokine agents which are purely peptidyl in 
25 composition can be either chemically synthesized by solid- 
phase methods (62) or produced using recombinant technology 
in either prokaryotic or eukaryotic systems. The synthetic 
and recombinant miethods are well known in the art . 

30 Non-chemokine agents which contain biotin or other 
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nonpeptidyl groups can be prepared by chemical modification 
of synthetic or recombinant .chemokines or non-chemokine 
agents. One chemical modification method involves 

periodate oxidation of the 2 -amino alcohol present on 
chemokines or non-chemokine agents possessing serine or 
threonine as their N- terminal amino acid (63) . The 
resulting aldehyde group can be used to link peptidyl or 
non-peptidyl groups to the oxidized chemokine or non- 
chemokine agent by reductive amination, hydrazine, or other 
chemistries well known to those skilled in the art. 

This invention provides a compound having one of the 
following structures: 

A- (aYDINYYTSE3X) , ( SaYDINYYTSEP ) "A, or A- (aYDINYYTSE3 ) "A 
wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
epresents a tyrosine; each D represents an aspartic acid, 
ch I represents an isoleucine; and each N represents an 
asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the am.ino terminal direction; 

wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
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therefrom in the carboxy terminal direction ; 

wherein A represents a carboxyl group or .an amidated 
carboxyl group; 

wherein G represents an amino group or an acetylated amino 
5 group; 

wherein all of a, D, I , N, Y, T, S , E and 3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 
10 wherein A is peptidyl or nonpeptidyl agent, and the solid 
line represents a peptide linker, or a peptide, disulfide, 
or other chemical bond. 

In one embodiment, the A is a nonpeptidyl agent, and the 
15 nonpetidyl agent polyethylene glycol. 

This invention will be better understood from the 
Experimental Details that follow. However, one skilled in 
the art will readily appreciate that the specific methods 
20 and results discussed are merely illustrative of the 
invention as described more fully in the claims that follow 
thereafter . 
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EXPERXMENTAL DETAILS 

First Series of ExTperiments 
A . Materials 

Purified recombinant CD4-IgG2 protein was produced by 
Progenies Pharmaceuticals, Inc. from plasmids CD4-IgG2-HC- 
pRcCMV and CD4 -kLC-pRcCMV as described (Allaway et ai . AIDS 
Res. Hum. Retrovix'uses 11:533, 1995). Soluble CD4 is 
commercially available (NSN Life Science Products, Boston, 
MA) . Anti-CCR5 MAb 2D7 was purchased from Pharmingen (San- 
Diego, CA) . 



The plasmids designated PPI4 - tPA.-gpl2 0j^.pi^-V3 and PPI4-tPA- 
gpl2 0DHi23 were prepared as described (Hasel et al , US 
Patents 5,869,624 and 5,886,163). Monomeric gpl20 

glycoproteins were produced in CHO cells stably transfected 
with the PPI4-tPA-gpl20 plasmids and purified to 
homogeneity as described (Hasel et al . US Patents 5,669,624 
and 5,886,163 ; Trkola et al . Nature 384:184, 1996). The 
antibodies designated PA8 , PAIO, PA12 and PA14 were 
prepared by growing the corresponding hybridoma cell line 
in mouse ascites and isolating the antibody using protein A 
affinity chromiatography as described (Olson et al . J.Virol. 
73:4145, 1999). LI . 2 -CCR5* eel 1 s were cultured as described 
(Olson et al . J.Vir-ol. 73:4145, 1999). 



Peptides containing different segments of the CCR5 Nt were 
custom- synthesized by solid-phase f luorenylmethoxycarbonyl 
chemistry using phospho- and sul fo- tyrosine precursors as 
building blocks . where indicated (Figure 6) . Biotinylated 
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versions of peptides S-10/14 and P-10/14 incorporated a C- 
terminal GAG spacer preceding a biotinylated lysine. 
Following cleavage from the resin, peptides were purified 
by reverse -phase chromatography' on CI 8 columns (Vydac, 
Kesperia, CA) and analyzed by HPLC and mass spectroscopy. 
Figure 6 describes the different peptides that were used in 
this study. 

Binding of apl20 to CCR5 

A gpl20/CD4 complex formed from monomeric gpl20 (lOOnM) and 
biotinylated CD4-lgG2 (50nM) was added to 1x10^ L1.2-CCR5* 
cells in the presence of different concentrations of 
peptide (Olson et a J . J.Virol. 73:4145, 1999). CD4-IgG2 is 
tetrameric and therefore binds four molecules of gpl2 0, 
which increases binding of the complex to CCR5 (Allaway et 
al . AIDS Res. Hum. Retroviruses 11:533, 1995). The mean 
fluorescence intensity (m.f.i.) was measured by flow 
cytometry after addition of phycoerythrin (PE) -labeled 
streptavidin (Becton Dickinson, San Jose, CA) . Inhibition of 
gpl20/CCR5 binding was calculated: (m.f.i. with 
peptide) / (m. f . i . without peptide) xlOO%. ■ •/ 

It was first tested whether tyrosine-sulfated peptides 
spanning amino acids 2-18 of the CCR5 Nt could inhibit 
binding of the gpl2 Ojr.p:^/CD4 - IgG2 complex to CCR5* cells. The 
HIV-lji^.PL isolate exclusively uses CCR5 as a co-receptor 
(Dragic et si. Nature 381:667, 1996). Only peptides S- 
3/10/14 and £-10/14 inhibited complex binding to the cells 
in a dose-dependent manner (Fig. la) . Peptides S-10 and S- 
14 had no inhibitory activity, even at the highest 
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concentrations (Fig. la). Peptide TS-lO/14, spanning amino 
acids 10-14, did not inhibit gpl20^WCD4 -IgG2 binding to 
CCR5* cells, despite the presence of two sulfo- tyrosine 
residues (Fig. lb) . 



Tyrosine -phosphorylated peptides P-10/14 and P-3/10/14 did 
not inhibit gpl2 0j«.p^/CD4-IgG2 binding to CCR5* cells (Fig. 
lb) . As further specificity controls we synthesized 
peptides containing the first seventeen residues of the 
CCR5 Nt in random order with sulfo- tyrosines in positions 
10 and 14 (SS-10/14) or in positions 2 and 12 (SS-2/12). 
Neither one of these peptides reduced gpl2 0^.,,/CD4 - IgG2 
binding to CCR5* cells, even at the highest concentrations 
(Pig. lb) . 

Surface olasmon rer s onance Tneapni-rementi r (BIAro-rp) 
Streptavidin- coated sensor chips (BIAcore AB, Sweden) were 
conditioned with five injections of regeneration solution 
(IM NaCl, 50mM NaOH) and equilibrated with HBS-EP buffer 
(lOmM KEPES, IBOtnM NaCl, 3M EDTA, 0.005% polysorbate 20) as 
recommended by the manufacturer. Biotinylated peptides Vere 
then immobilized on the chip by injection of peptide 
(lOOnM) in HBS-EP buffer, followed by an injection of 
regeneration solution and equilibration with KBS-EP buffer. 
4 00 resonance units (RU) of peptide were bound to the 
sensor chip surface. Solutions of the following proteins 
(lOOnM) were passed over the sensor chip surface: gpl20, 
SCD4, gpl20/sCD4, PA8 , PAIO and 2D7 . Surface plasmon 
resonance was monitored and displayed in arbitrary 
resonance units (RU) as a function of time. Following 
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injection of each solution the chip was regenerated and 
equilibrated as described above, 

Biotinylated peptide was attached to the streptavidin- 
coated gold surface of a sensor chip and solutions 
containing different gpl2 0/sCD4 complexes were flowed over 
the immobilized peptide. Adsorption of the complex due to 
complex/peptide binding was detected by an increase in 
surface ■ plasmon resonance signal (RU) , which reports 
changes in the effective refraction index very near the 
gold surface of the sensor chip (Schuck Ann. Rev. Biophys 
Biomol Struct 26:541, 1997). For proteins of similar size, 
such as the different gpl2 0/sCD4 complexes, RU plateau 
values are directly proportional to the amount of protein 
bound to the peptide. 



Specific association of the gpl2 0^^.pj,/sCD4 complex with the 
sulfo-tyrosine-containing peptide bS-10/14 was accompanied 
by a significant increase in RU {Fig. 2a) . The signal 
plateau but not the shape of the sensograms varied with 
gpl2 0jR.pi,/sCD4 concentration indicating that ■ -.the 
peptide/complex interaction was dose-dependent (data not 
shown) . The sensorgram obtained with bP- 10/14 is similar 
to the one obtained in the absence of peptide, indicating a 
complete lack of association of the phosphorylated peptide 
with the protein complex (Fig. 2a). Neither gpl20^.p^ nor 
SCD4 alone produced a significant increase in RU, 
indicating that they did not associate with the immobilized 
peptides. (Fig. 2b, c). The gpl2 0 - AV3^WsCD4 complex was 
also unable to associate with the peptides (Fig. 2d) 
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To further ascertain the specificity of the peptide/complex 
association we performed BIAcore analyses using envelope 
glycoproteins from KIV-13hi23/ an -R5X4 isolate, and HIV-l^^j, 
an X4 isolate (5) . gpl 2 0^^123 /sCD4 associated specifically 
with the sulfated peptide, although the plateau RU values 
were lower than those observed with gpl2 0jR.FL/sCD4 (Fig. 
2e) . We did not detect any binding of gpl2 0dhi23/sCD4 to the 
phosphorylated peptide (Fig.2e) , nor did gpl20D„i23 alone 
associate with the peptides (Fig. 2f) . Finally, gpl20LAi with 
or without sCD4 was not able to associate with either one 
of the peptides (Fig. 2g,h). 

These methods could be readily modified to screen for 
agents that bind CCR5 or that block its interaction with 
antibodies, gpl2 0 or other ligands. For example, direct 
binding of the agents could be analyzed as described above, 
where the agent is substituted for the anti-CCR5 antibody 
or gpl20/sCD4 complex. In another embodiment, the agent 
could be m.ixed or pre-incubated with the anti-CCR5 antibody 
(or gpl2 0/sCD4 complex) prior to passing the mixture over 
biosensor chips as described above. . - 

Binding of MAbs to CCR5 

L1.2-CCR5 cells (1x10^) were incubated with anti-CCR5 MAb 
(50nM) ± peptide (lOOjuM) . MAb binding was detected using a 
PE-labeled goat anti-mouse antibody (Caltag Laboratories, 
Burlingam, CA) . The m.f.i value was measured by flow 
cytometry as described (Olson et al . J. Virol. 73:4145, 
1999) . MAb binding was calculated as above. 
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We determined whether the CCR5 Nt peptides could inhibit 
binding of a panel of anti-CCR5 MAbs to CCR5* cells. PA8 
binding was reduced significantly by all wild-type peptides 
containing amino acids 2-18, "regardless of tyrosine 
modification (Fig. 3) . BIAcore analysis confirmed that PA8 
similarly and specifically associated with both sulfated 
and phosphorylated peptides (Fig. 4) . Binding of PA12 to 
CCR5 was not inhibited by any of the peptides (Fig. 3) 
PAIO binding to CCR5 was inhibited only by S-3/10/14 (Fig. 

3) . PAIO was also observed to associate with bS-10/14 and 
to a lesser extent with b?-10/l4 in BIAcore analysis (Fig. 

4) , which may be more sensitive than the gpl2 0/CCR5 -binding 
assay. Binding of 2D7 to CCR5 was not inhibited by any of 
the peptides (Fig. 3) . No significant interaction was 
observed between any CCR5 Nt peptide and Mab 2D7 (Figs. 3 
and 4), whose epitope resides within the second 
exti-acellular loop on CCR5 . 

Single cycle HIV-1 entry assay 

Nlluc'env" particles pseudotyped with envelope glycoproteins 
fi-om MuIiV, HTLV-1 and HIV-1 strains JR-FL, KxBs , DH123 , Gun- 
1 were made as described (Dragic et al , J. Virol; 72:279, 
1998) , Target cells (Hela-CD4*CCR5* or U8 7-CD4-^CCR5-^) were 
incubated with virus - containing supernatant fractions 
(lOOng/ml p24) ± peptide (lOO^M) for 4 h. then washed and 
resuspended in culture media. After 4 8 hours the cells were 
lysed and luciferase activity (relative light units, 
r.l.u.) was mieasured using a standard kit (Promega, 
Madison, WI) as described (Dragic et ai . J. Virol. 72:279, 
1998) . Viral entry was calculated: (r.l.u. with 
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peptide) / (r . 1 .u without peptide) xlOO%. 

The ability of different CCR5 Nt peptides to inhibit HIV-l 
entry into CD4*CCR5*CXCR4* cells was tested using a 
lucif erase-based single round of entry assay (5) . Only 
peptides S-10/14 and S-3/10/14 inhibited the entry of the 
R5 isolate HIV-ljR.pi^ by approximately 50% in HeLa-CD4*CCR5* 
and U87MG-CD4*CCR5* (Fig. 5 and data not shown) . We were 
unable to inhibit the entry of the R5X4 isolates HIV-lr.«.,, 

DH12 J 

and HIV-Iq^^.-l, or of the X4 isolate HIV-1„_^2. The entry of 
MuLV and HTLV pseudotypes was also unaffected by the 
peptides (Fig. 5) . 



Screening assays 

1) HIV-l gpl2 0/CD4-IgG2 

Streptavidin-coated 96-well microtiter plates (NEN Life 
Science Products, Boston, MA) are blocked with 200 /zl/well 
of 5% bovine serum albumin (Sigma, St. Louis, MO) in PBS 
buffer and washed with assay buffer (0.5% Tween 20, 1 % 
fetal bovine serum, and 2% BSA in PBS buffer) . The plates 
are then incubated 1 hour at ambient temperature with 100 
^1/well of biotinylated CCR5 N- terminal sulfopeptide at a 
concentration of 500 /iM in assay buffer. Following a wash 
step, the plates are incubated for 1 hour at ambient 
temperature with an KIV-lj^.p-L gpl2 0/CD4 - IgG2 complex in the 
presence or absence of inhibitory agent. The plates are 
again washed and incubated for 30 minutes with a 
horseradish peroxidase-labeled goat antibody to human IgG 
(Kirkegaard Sc Perry Laboratories, Gaithersburg , MD) 
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followed by addition of the TMB (3, 3 ',5,5'- 
tetramethylbenzidine) chromogenic substrate (Pierce) . The 
reaction is stopped by addition ^^of 100 jul/well of 2N H2SO4 
pr-ior to colorimetric detection af' a wavelength of 450 nm. 
Wells without biotinylated peptide serve as negative 
controls. The percent inhibition of binding is calculated 

[1 - (OD„ith inhibitor - OD^cntrol well)/ (OD without inhibitor " OD^ontrol 

well)] ^ 100, where OD represents the average optical density 
observed for the indicated wells. 

2 ) Ant i - CCR5 ant ibodies 

Streptavidin-coated microtiter plates are blocked and 
incubated with CCR5 N-terminal peptide as described above. 
Following a wash step, the plates are incubated for one 
hour at- arribient temperature with the anti-CCR5 antibody 
PAIO in the presence or absence of inhibitory agent. The 
plates are again washed and incubated for 3 0 minutes with a 
horseradish peroxidase-labeled goat antibody to mouse IgG 
(Kirkegaard Sn Perry Laboratories, Gaithersburg, MD) 
followed by addition of TMB substrate for colorimetric 
detection as described above. The percent inhibition 
mediated by the inhibitory agent is calculated as described 
above . 



Discuss ion 

Tyrosine -modified peptides 
Nt that contains residues 
synthesized. (Dragic et al , 
et al , J. Virol. 72:3464, 



spanning the region of the CCR5 
important for viral entry were 
J. Virol. 72:279, 1998; Rabut 
1998; Farzan et al . J, Virol. 
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72:1160, 1998; Dorantz et al . J. Virol. 71:6305, 1997). 
Interactions between the Nt " peptides and gpl20/CD4 
complexes were characterized. Peptides containing sulfo- 
tyrosines in positions 10 and 14 efficiently inhibited 
5 binding of gpl2 0jr_pl/CD4 to CCR5 . Substitution of the 
sulfate groups for phosphates, which are also negatively 
charged at physiological pH, rendered the Nt peptides 
inactive. Inhibition of gpl20/CCR5 binding was dependent, 
therefore, on the presence of sulfate moieties and was not 

10 simply due to non-specific electrostatic interactions 
between the peptide and the gpl20/CD4. complex or the 
peptide and the cell surface. Inhibition of gpl20/CCR5 
binding was also dependent on the primary structure 
surrounding the sulf o-tyrosines since peptides with random 

15 sequences of CCR5 amino acids 2-18 * had no inhibitory 
activity. Additional Nt amino acids in the region 2-18 w^ere 
important for activity since a shortened peptide containing 
just amino acids 10-14 was unable to inhibit gpl20/CD4 
binding, despite the presence of two sulf o-tyrosines . It 

20 would be straightforward to define the minimum number of 
amino acids needed for activity by systematically 
synthesizing sulf opept ides intermediate in length between 
peptide 2-18 and peptide 10-14. Similarly, sulf opeptides 
that incorporate a greater portion of the CCR5 Nt could be 

25 easily synthesized and tested for activity using the 
methods described herein. 

Qualitative BIAcore analyses allowed the demonstration of a 
highly specific, CD4 - dependent interaction between a 
30 tyrosine-sulfated Nt peptide and gpl20jR.FL, No binding of 
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the protein complex to a tyrosine-phosphorylated peptide 
was observed. Only gpl20s derived from isolates that use 
CCR5 as a co-receptor associated ^ with the sulfated peptide. 
gpl2 0DHa23/CD4 binding was weaker than gpl2 0jr_pl/CD4 binding, 
suggesting that envelope glycoproteins from R5X4 isolates 
have a lower apparent affinity for CCR5 than envelope 
glycoproteins from R5 isolates. gpl20LAi/ derived from an 
isolate that only uses CXCR4 , did not bind to the sulfated 
peptide. A V3 loop-deleted gpl20jR.pL ciid not associate with 
the sulfated peptide, just as this protein was unable to 
bind to full length CCR5 on the -cell surface (Trkola et al , 
Nature 384:184, 1996). 

The binding of the Nt peptides to several anti-CCR5 MAbs, 
all of which recognize conformational epitopes in CCR5 and 
inhibit gpl2 0/CCR5 binding were also studied. PA12 and 2D7 
did not bind to any of the peptides. Binding of PAS to the 
peptides was independent of tyrosine-modif ication whereas 
PAIO associated more with the sulf o- tyrosine-containing 
peptide than with the phospho- tyrosine-containing peptide. 
It seems, therefore, that sulf o - tyrosines and phosp.ho- 
tyrosines are relatively interchangeable for the, purpose of 
MAb binding but that gpl2 0/CD4 binding has an absolute 
requirement for sulf o-tyrosines . Relatively subtle 
differences in size and geometry of sulfate and phosphate 
groups might be relevant for binding of the CCR5 Nt with 
gpl2 0, which must not only accept the negative charge, but 
also coordinate, probably by hydrogen bonds, the tyrosine 
sulfate oxygens. The kinetics of MAb binding to the CCR5 Nt 
peptides exhibited large apparent on rates and slow 
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apparent off rates, which also differed from our 
observations of gpl20/CD4 binding kinetics. 

None of the Nt peptides inhibited' MuLV, HTLV and HIV-1hxb2 
5 envelope-mediated viral entry, which is not mediated by 
CCR5. In contrast, peptides S-10/14 and S-3/10/14 
specifically inhibited the entry of the HIV-lj^.p^^ R5 strain 
in two different cell lines. The inhibition of HIV-1 entry 
by tyrosine-sulfated peptides was partial (-50%) but 
10 nonetheless striking given the difficulty of blocking this 
process with short, linear peptides {Jameson et al . Science 
240:1335, 1988; Chan and Kim Cell 93:6 81:1998; Doranz et 
ai . vJ. Bxp. Med. 186:1395, 1997; Heveker ■ et al . Current 
Biology 8:369, 1998; Eckert et al . Cell 99:1, 1999). 
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Second Series of Exiper iment s 

CD4 and CCR5 mediate fusion and entry of R5 HIV-1 strains. 
Sulf ©tyrosine and other negatively charged residues in the 
CCR5 amino -terminal domain (Nt) ' are crucial for gpl20 
5 binding and viral entry. It is shown' that a soluble 
gpl20/CD4 complex specifically binds to a peptide 
corresponding to CCR5 Nt residues 2-18, with sulf ©tyrosines 
in positions 10 and 14. This sulfopeptide also inhibits 
soluble gpl20/CD4 binding to cell surface CCR5 as well as 

10 infection by R5 virus. These observations prompted the 
fux^ther delineation of the determinants af the gpl20-CCR5 
Nt sulfopeptide interaction. It is shown that residues 10- 
18 constitute ' the minimal domain of the CCR5 Nt that is 
able to specifically interact with soluble gl20/CD4 

15 complexes. In addition to sulf ©tyrosines in positions 10 
and 14, negatively charged residues in positions 11 and 18 
participate in this interaction. Furthermore, the CCR5 Nt 
binds to a CD4- induced surface on gpl2 0 that is composed of 
conserved residues in the V3 loop stem and the C4 domain. 

20 Binding of gpl20 to cell surface CCR5 , however, is further 
influenced by variable residues in the crown of the,.. V3 
loop. This data suggest that gpl20 docking to CCR5 is an 
interdependent, multi-step process involving different 
regions of the envelope glycoprotein and the co-i^eceptor . 

25 

Entry of HIV-1 R5 isolates into target cells is mediated by 
the successive interaction of the envelope glycoprotein 
gpl20 with CD4 and the CCR5 co-receptor [3] . Gpl20-CD4 
complex fox-mation generates a large bonding energy that 
30 drives reordering of the gpl20 core structure [22, 31, 47] . 
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Changes in the orientation of the V1/V2 and V3 loops, as 
well as the bridging sheet (composed of the V1/V2 stem' and 
C4); cooperatively create and/or expose a co-receptor 
binding site on cpl20 [22,37,47], The predicted co-receptor 
5 binding surface on gpl20 has a hydrophobic core surrounded 
by a positively charged periphery and is composed of both 
conserved and variable residues located in the C4 domain 
and V3 loop, with lesser contributions from the V1V2 stem 
[22 , 36, 37] . 

10 

It has been demonstrated that specific amino acids within 
the CCR5 amino-terminal domain (Nt, amino acids 2-31) , 
including negatively charged and tyrosine residues, are 
essential for CCR5 -mediated fusion and entry of R5 and R5X4 
15 HIV-1 strains [5, 12, 13, 15, 35]. Farzan et ai . [16] 
demonstrated that the CCR5 Nt undergoes both O- 
glycosylation and tyrosine sulfation. It is presently not 
known whether O-glycosylation plays a role in co-receptor 
function, but this possibility is suggested by observations 
20 that serines in the Nt are important for viral entry. 
• Inhibition of cellular sulfation pathways, inducting 
tyrosine sulfation, greatly decreases gpl2 0 binding to CCR5 
as well as the entry of R5 and R5X4 HIV-1 strains into 
target cells ([16], E.G.C. unpublished data). Post- 
25 translational sulfation of the tyrosine residues in the 
CCR5 Nt, therefore, may critically modulate the 
susceptibility of target cells to HIV-1 infection in vivo. 

It was demonstrated that a CCR5 Nt -based peptide spanning 
30 residues 2-18 and containing sul f otyrosines in positions 10 
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and 14 specifically associates with soluble gpl20/CD4 
complexes containing envelope glycoproteins from R5 (JR-FL) 
and R5X4 (DH123) but not X4 (LAI) strains [11] . Peptides 
containing unmodified tyrosines*' or phosphotyrosines , 
however, did not bind soluble gpl20/CD4 complexes [11]. The 
tyrosine-sulfated CCRS Nt therefore specifically interacts 
only with gpl20 proteins from isolates that use this co- 
receptor to gain entry into target cells. Furthermore, only 
the CCR5 Nt -based sulfopeptide inhibits binding of soluble 
gpl2 0jR.FL/CD4 to intact, cell surf ace- expressed CCR5 and 
moderately blocks the entry of ' the R5 isolate JR-FL. The 
affinity of soluble gpl2 0/CD4 for the CCR5 Nt sulfopeptide, 
however, is approximately 10 -100 -fold lower than for the 
native, membrane- associated co-receptor [11, 42, 46], 
suggesting that other gpl20-CCR5 contacts are required to 
consolidate this interaction. This concept is further 
suppox'ted by studies of CCR5 chimera, as well as studies 
with inhibitors of CCR5 co-receptor function [12, 34, 38, 
32, 14] . 

A novel SLISA is reported to detect binding of 
sulf opeptides to soluble gpl20/CD4 complexes, as well as 
anti-CCR5 MAbs and chemoJ^ines . ELISA and surface plasmon 
resonance (SPR) were used to further delineate the 
determinants of the gpl20-CCR5 Nt interaction. In order to 
define the minimal domain of the CCR5 Nt capable of 
specifically binding to soluble gpl20/CD4 complexes, 
sulf opeptides corresponding to different regions of the Nt 
were analysed. To identify the gpl2 0 domains involved in 
sulfopeptide binding, inhibition of gpl2 0/CD4 complex 
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binding to CCR5 Nt sulf cpeptides by anti-gpl20 Mabs was 
studied. Residues in or near the epitopes of inhibitory 
MAbs were mutated to alanine, and the gpl2 0 point mutants 
were compared for their ability to bind to CCR5 Nt 
5 sulf ©peptides and cell-surface CCR5 . The data suggest that 
a mostly conserved surface of gpl20 binds to a nine-residue 
stretch of the CCR5 Nt , whereas more variable residues in 
the crown of the V3 loop may interact with a secondary 
binding site on CCR5 . 

10 

Materials and Methods 

Reagents: CD4-IgG2, soluble CD4 (sCD4) , recombinant soluble 
gpl20s from HlV-l^j (X4), KIV-1i,hi23 (^5X4), and HIV-lj^.^^ (R5) 

15 isolates, anti-gpl20 MAb ?A1 (directed against the V3 loop 
of JR-FLi) and anti-CCR5 MAbs PAS, PAIO, PAll, PA12 , PA14 
were produced by Progenies Pharmaceuticals, Inc. 
(Tarrytcwn, NY) as described [1, 32] . MAbs 133-290, 133- 
192, 135-9, A32, 17b, 19b, 48d, 9284, G3-42, Cll, G45-60 

20 and 2G12 were a generous gift [26] . The small -molecule CCR5 
antagonist TAK-779 was obtained as described [14] . 

Peptides corresponding to different segments of the CCR5 Nt 
v,;ere synthesized as described previously (Table 1) [11] . 
25 Sulfo- or phospho-tyrosines were incorporated in positions 
10 and 14, and all peptides carried a carbcxy- terminal Gly- 
Ala-Gly spacer preceding a biotinylated lysine. Residues 
were numbered according to their positions in the full 
length CCR5 protein. 
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Surf ace olasmon resonance : Binding of gpl20/CD4-IgG2 complex 
and MAbs to CCR5 Nt-based peptides was measured as 
previously described [11] . Briefly, streptavidin-coated 
sensor chips were divided into two surfaces, each with a 
5 separate flow chamber. The sensor chip was conditioned and 
equilibrated as recommended by the manufacturer. 
Biotinylated peptide (400 resonance units, RU) was bound to 
the surface of the second chamber whereas the first chamber 
of the chip was used as a negative control. Gpl2 0/CD4 - IgG2 

10 complex (50 nM) was passed over the chip surface in the 
presence or absence of MAbs (150 mM> Surface plasmon 
resonance was m.onitored and displayed in RU as a function 
of time using a Biacore X. After each measurement the chip 
was regenerated and equilibrated as recommended by the 

15 manufacturer. 

Generation of apl20 alanine mutants and their binding to 
CD4 - IctG -. : Mutant gpl20 proteins were generated using the 
QuickChange Kit from Stratagene (San-Diego, CA) . Gpl20jy».FL/ 

20 cloned into the pPPI4 expression vector [4] , served as the 
template for site directed mutagenesis. Nucleot^ide 
sequencing was performed to ascertain the presence of the 
appropriate mutation in the gpl20 coding sequence. 293T 
cells were calcium phosphate transfected with the different 

25 mutant gpl20 expression constructs. Supernatants containing 
soluble gpl2 0 proteins were harvested and cleared of debris 
by centrif ugat ion 24 hours post - transf ection . 

Quantification of gpl20 was performed by ELISA as 
previously described [40] ) . Briefly, 293T supernatants were 

30 boiled for 5 minutes and denatured gpl20 was captured on an 
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ELISA plate coated with D7324 (International Enzymes Inc. 
Fallbrook, CA) , a MAb that recognizes a 15 residue linear 
epitope in the carboxy-terminal - end of gpl20. Captured 
9pl20 was detected by a mixture of" anti-gpl20 MAbs B12 and 
5 B13 [40] , followed by incubation with a horseradish 
peroxidase-conjugated (HRP) anti-mouse IgG antibody 
(Amersham Pharmacia, Piscataway, N.J.). Optical density 
(O.D.) was measured at 450 nm using the ImmunoPure TMB 
Substrate kit (Pierce, Rockford, IL) . 

10 

CD4-IgG2 binding to non-denatured mutant gpl20 proteins also 
was measured. Plates coated with D7324 were used to capture 
native gpl2 0 from supernatant s of transiently transfected 
293T cells. CD4-IgG2 (50 nM) was added to the plates and its 
15 binding was detected using an KRP- conjugated goat anti- 
human IgG and TMB substrate as described above. 

CCR5 Nt -peotide ELISAs : Streptavidin- coat ed ELISA plates 
(Pierce, Rockford, IL) were blocked with D-P3S/ 5% BSA for 

20 2 hours at room temperature then washed three times with 
assay buffer (D-PBS/ 0.5% Tween 20/ 1% Fetal Bovine Serum/ 
2% BSA) . Plates were then contacted with sulfo- or phospho- 
peptides (1 /ig/ml) for 1 hour at room temperature and 
washed three times with assay buffer. Mixtures of CD4-IgG2 

25 (50 nM) and purified gpl20 or gpl2 0 -containing supernatants 
in a 1:4 molar ratio were added to the plates for 1 hour at 
room temperature. Plates were washed three times and (HRP) - 
conjugated goat anti-human IgG was used to detect the 
presence of bound CD4-IgG2- The plates were developed using 

30 the TMB substrate as described above. Gpl20/CD4-IgG2 binding 
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to the peptides was normalized for CD4-IgG2 binding to the 
mutant 9pl20 proteins. 

In a competition ELISA, peptides were captured onto the 
plates as described above. Inhibitor or assay buffer was 
added for 1 h prior to addition of gpl 2 Ojr.pl/CD4 - 1 gGa complex 

(1 nM) for an additional h at room temperature. The assay- 
was then completed as described above. Direct binding of 
anti-CCR5 murine MAbs to the peptides was examined as 
described above except that MAb was substituted for 
gpl20/CD4-IgG2 complex and a goat anti-mouse HRP-coupled 
antibody was used for detection. 

Binding of opl2 0 /CD4 - IgG ^ complexes to cell-surface CCR5 : 
L1.2-CCR5'' cells (10^) were incubated with gpl2 0 - containing 
supernatant (lOOnM) and biotinylated CD4-IgG2 (50nM) fox' 1 
hour at 37°C in assay buffer, as previously described [32]. 
Gpl20/CD4 -IgGs bound to the cells was revealed by FACS 
analysis of the mean fluorescence of intensity (m.f .i.) 
after addition of strept avidin-PE {Pharmingen, San-Diego, 
CA.). Binding was calculated using the formula: (m.f.i. 
gpl20 mutants) / (m. f . i . gpl20 wild type) x 100% and 
normalized for CD4-IgG2 binding to the mutant gpl2 0 
proteins . 

Results 

A-n ELISA to detect binding of soluble cr)120/CD4 complexes 
to CCR5 Nt-based peptides : Surface plasmon resonance (SPR) 
was previously used to show that gpl20/sCD4 complexes 
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specifically interact with a peptide spanning CCR5 residues 
2-18 and containing sulf otyrosines in positions 10 and 14 
(2-18, Table 1) . The on and off- rates of complex-pept ide 
binding were extremely rapid and could not be measured 
5 precisely by SPR. The Kd was estimated to be in the 10-"'-10"^ 
range. Replacing moncmeric sCD4 with tetiravalent CD4-IgG2, 
however, lead to a dramatic shift in both on and off rates, 
lowering the Kd into the 10"^- 10'^ range (Figure 7a) . This 
observation prompted us to develop an ELISA to directly 
10 detect complex-peptide binding. Streptavidin- coated ELISA 
plates were used to capture biot inylated, CCR5 Nt-based 
peptides, and then further incubated with soluble 
gpl20/CD4 -IgGj complexes. Complex binding was detected using 
an HRP- conjugated goat anti -human IgG antibody. 

15 

Sulfopeptide 2-18 bound gpl2 0jr.fl/CD4 - IgGj with an IC50 -InM, 
and gpl2 0r>Hi23/CD4-IgG2 with an IC50 -5nM (Figure 7b) . 
Sulfopeptide 2-18 did not measurably bind CD4-IgG2 alone or 
in complex with either gpl20L;,T or V3 loop-deleted gpl20jR.FL 

20 (Figui-e 7b and data not shown) . No binding was observed to 
an analogous CCR5 Nt phosphopept ide (2-18 (P), Figure 12). by 
any of the gpl2 0/CD4 - IgG2 complexes (Figure 7b) . Identical 
patterns of reactivity were observed for gpl20s in complex 
with CD4-Y2/ a divalent CD4 - imimunoglobulin fusion protein 

25 [data not shown, (1)] However, no binding was observed for 
gpl20 in comiplex with anti-gpl2 0 MAbs 2G12 and IgGlbl2, 
even though the latter' s epitope overlaps the CD4 binding 
site on gpl2 0 (data not shown) . Thus the ELISA reproduces 
the critical biological features of cell-surface CCR5/gpl20 

30 intex'actions , including a dependence upon CCR5 tyrosine 
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sulfation, CD4 , the V3 loop, and the coreceptor usage- 
patterns of the parent viruses. 

Using a competition ELISA, inhibition of gpl2 0jR.pL/CD4 - IgG2 
5 binding to the sulfopeptide 2-18 was enabled with the anti- 
CCR5 MAb PA8 . However, binding of soluble gpl2 0/CD4-IgG2 
complexes to the sulfopeptide was not inhibited by TAK-779, 
nor the CC- chemokines MlP-la, MIP-13 ^nd RANTES even when 
used at supraphysiologic concentrations. 

10 

Binding of CCR5 Nt peptides to anti-CCR5 MAbs and soluble 
cpl20 /CD4 : ELdSA was used to test the binding of a panel of 
anti-CCR5 MAbs to peptides 2-18 and 2-18 (P). We had 
previously demonstrated that MAbs PAS, PAll and PA12 bind 

15 epitopes in the Nt, PAIO binds an epitope that spans the Nt 
and ECL2 and PA14 binds an epitope exclusively in ECL2 
[32] . Here we show that FAS avidly binds peptides 2-18 and 
2-18 (P) (Figure 2). PAIO binds avidly to 2-18 and 
moderately to 2-18 (P). PA.ll binds moderately to 2-18 and 

20 weakly to 2-18 (P). PA12's binding is weak for 2-18 and 
undetectable for 2-18 (P) . Finally, PA14 does not recognize 
either the sulfopeptide or the phosphopept ide (Figure 8) . 
Furthermore, PAS binds similarly to all of the CCR5 Nt- 
based sulf ©peptides in Figure 12 (data not shown) . 

25 

In order to more precisely delineate the minimal CCR5 Nt 
domain that specifically binds to soluble gpl20/CD4 
complexes, a panel of sulf opept ides spanning different 
stretches of the CCR5 Nt were synthesized (Figure 12) . All 
30 of the peptides carx-ied sulf otyrosines in positions 10 and 
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14 since we previously showed that these are required for 
complex-peptide binding. Binding of gpl2 0jH.FL/CD4 - IgGj to the 
different sulf opept ides was tested by ELISA. Although the 
strongest binding was observed usihg longest sulf opept ide , 
5 -2-18, significant binding for peptide 10-18, which 
demonstrated -3 -fold lower avidity was also observed 
(Figure 9) . Peptides 8-15, 6-16 and 10-15 bound the soluble 
complex at least ten-fold less avidly than 2-18. (It was 
previously shown that a sulfopeptide consisting of residues 
10 10-14 did not bind soluble gpl20/CD4 complexes.) 
Furthermore, the gpl2 Oj^.p^/CD4 - IgG2 complex only weakly 
bound to peptide 10-18 carrying two alanine mutations in 
positions 11 and 18. Previous mutagenesis studies have 
shown that residues Asp- 11 and Glu-lS are important for 
fusion, entry and gpl20-CCR5 binding [12, 13] . Finally, it 
should be noted that the same binding patterns to the 
different sul f opept ides were observed with soluble 
complexes containing gpl20pHi23 (data not shown) . 
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Inhibition o f crDl20/CD4 bjndino to CCR5 Nt sul f open t i f^^c, v>y 

anti-apl2 0 MAbs : In order to determine which domains of 

gpl2 0 were involved in binding to CCR5 Nt -based 
sulf opeptides, the ability of a panel of well-characterized 
anti-gpl20 MAbs [25] to inhibit binding of either the 
gpl2 0a^_P^/CD4-IgG2 or the gpl2 Cc.,23/CD4 - IgG2 complex to the 
2-18 sulfopeptide was tested. Only MAbs directed against 
CD4- induced (CD4i) epitopes and the V3 loop were capable of 
inhibiting binding of the gpl2 0/CD4 - IgG2 complex to the 
CCR5 sulfopeptide. Inhibition of gpl2 0j^.pL and gpl2 0^Hi23 
binding by MAbs 17b and 48d was >90%. The anti-V3 loop MAb 
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19b [28] , which recognizes an epitope in the crown of the 

V3 loop (sequence -I G--FY-T) and is reactive with R5 

strains, inhibited gpl2 0jR.FL binding >90% and gpl2 0DKi23 
binding by approximately 80%. Anti-V3 loop MAb PAl , which 
5 was raised against gpl20jH_pL (W.C.O,, unpublished results) 
efficiently inhibited binding of gpl2 0j3i.pi^ but not gpl2GDHi23- 
Finally, anti-V3 loop R^^b 9284 [21] , which recognizes an 
epitope spanning residue- 307 to 330 in the V3 loop of X4 
strains, was unable to inhibit binding of either gpl20 

10 protein to the sulf opeptide . MAbs directed against other 
epitopes in other constant and variable ^regions of gpl2 0 
also had no effect on binding of the soluble complex to the 
peptides. Similar results were obtained when the anti-gpl20 
MAbs were used to inhibit soluble complex binding to cell 

15 surface CCR5 (data not shown) . 

Binding of mutant soluble gr)12 0/CD4 com^olexes to CCR5 Nt 

sulf opeptides : Numerous studies have shown that residues in 
the V3 loop determine co-receptor usage and binding [6-10, 

20 18, 20-21, 23-24, 27, 29, 33, 41-44, 46]. The crystal 
structure of a gpl20 lacking the V1/V2 and V3 loops..- in 
complex with sCD4 and the 17b MAb further implicated a 
conserved, CD4i surface on gpl20, adjacent to the V3 loop, 
in co-receptor binding [36, 37] . Single alanine mutants of 

25 all of the residues near or within regions previously shown 
to be important for co-receptor usage were generated. These 
gpl2 0 mutants were tested for their ability to bind to the 
CCR5-based sulfopeptide 2-18 as well as to cell surface 
CCR5 . Binding was normalized for gpl20 mutant binding to 

30 CD4-IgG2. Wild- type levels of binding were observed for all 



-101- 

mutants except W4 2 7A, R4 4 0A and R4 6 9A, which bound CD4-IgG2 
with 5-10-fold lower but nonetheless measurable avidity. 

Residues in both strands of the V3 loop stem were found to 
be involved in gpl20 binding to the 2-18 sulf opept ide : 
Alanine mutants of residues R298, N301, T303, 1322, D324, 
1325 and R326 were found to decrease complex binding to the 
peptide by >10-fold. Residues in the crown of the V3 loop, 
including the GPGR motif, however, had no effect on gpl2 0 
binding to the sulf opeptide . C4 residues in or adjacent to 
the two C- terminal 3- strands of the bridging sheet were 
also shown to participate in binding to the sulf opept ide ; 
Alanine substitutions of R419, 1420, K421, Q422 and R444 
decreased complex binding to the sulfopeptide by 5-10-fold. 
None of the alanine substitutions that we introduced in the 
other regions of gpl20 significantly affected complex- 
peptide-interactions . 

It was furthermore demonstrated that additional gpl20 
residues are involved in complex binding to cell surface 
expx-essed CCR5 . Alanine substitutions of residues .S30£; 
G310, P311, R313, F315, Y316 in the crown of the V3 loop 
decreased complex binding to CCR5 by 5-10-fold. 
Furthermoi-e , alanine substitutions of several residues in 
CI, C2 and C3 also had a moderate effect on complex binding 
to CCR5 . Finally, it is noted that alanine substitutions of 
R44 0 and R4 69 increased complex binding to both 2-18 and 
CCR5 ; whereas substitutions of E320 and W427 increased 
complex binding to CCR5 only. 



Tyrosine-sulfated CCR5 Nt peptides were studied for binding 
to soluble gpl20/CD4 cornpiexes as well as anti-CCR5 MAbs, 
CC-chemokines and TAK-779 using a* novel solid-phase ELISA. 
Inhibition of peptide-complex interactions by anti-gpl20 
MAbs was explored by surface plasmon resonance. These Mabs 
were also tested for their ability to inhibit complex 
binding to cell surface CCR5 . In addition, a panel of 
gpl2 0 point mutants were generated and then their 
reactivity was compared with CCR5 Nt peptides and cell 
surface OCRS. The principal conclusions are that (1) 
residues 10 to 18 of the CCR5 Nt may define the minimum 
recognition site for gpl20, (2) gpl20 binding to the CCR5 
Nt depends on highly conserved residues located in the C4 
domain and the stem of the V3 loop, and (3) gpl20 binding 
to cell surface CCR5 depends on a broader region that 
includes residues in the crown of the V3 loop, CI, C2 and 
C3 . The findings suggest that distinct domains of gpl20 
and CCR5 bind in a miulti-step fashion and raise questions 
about the determinants of specificity of the co-receptor- 
gpl20 interaction. . .,• . 

A.n ELISA was developed to detect complex-pept ide binding 
based on the observation that the tetravalent gpl20/CD4 - IgG- 
complex binds to CCR5 Nt sulf ©peptides ten-to a hundred - 
fold more avidly than the monovalent gpl2 0/sCD4 complex. 
Complex- sulfopept ide binding was only observed with gpl20 
proteins derived from R5 and R5X4 , but not X4 HIV-1 
strains. V3 loop deleted gpl20j^.pL failed to bind to the 
sulf opeptides . Phosphopept ides did not bind to any of the 
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soluble complexes. Thus, the BLISA reproduces the critical 
biological features of cell-surface CCR5-gpl20 

interactions, including a' dependence upon CD4 , CCR5 
tyrosine sulfation, the V3 loop and the co-receptor usage 
5 patterns of the par-ental viruses . 

CCR5 Nt phosphopeptides and sulf opeptides were 
differentially recognized by anti-CCR5 MAbs in ELISA. PAS 
possessed equal avidity for sulfated and phophorylated 

10 peptides, implying that its epitope does not include 
tyrosine side chains. PAIO and PAll preferentially 

recognized the sulf cpept ide , albeit with varying 
efficiencies, suggesting that sulf otyrosines participate 
either directly in peptide-MAb interactions or indirectly 

15 by influencing epitope conf crmtat ions . PA12 only interacted 
with the sulfopeptide and PA14 did not bind either Nt 
peptide. Ir was previously shown that both of these MAbs 
recognize discontinuous epitopes comprising residues in the 
Nt and ECL2 of CCR5 . The observations now imply that ECL2 

20 residues are marginal for PA12 binding and essential for 
PA14 binding to CCR5 . Finally, binding of soluble 

gpl20/CD4 complexes to CCR5 Nt peptides couls be completed 
with an anti-CCR5 Mab but not with either CC-Chemokines or 
TAK-77S, whose binding sites have been mapped to other 

25 regions of CCR5 (14, 39). Both CC-chemokines and TAK-779, 
however, are able to compete with gpl2 0/CD4 binding to cell 
surface CCR5 , perhaps through stex'ic or comf ormiational 
mechanisms. (14, 42, 46). It is noted that Farzan et al . 
reported that a CCR5 Nt sulfopeptide spanning residues 1-22 

30 partially blocks MlP-la binding to cell-surface CCR5 and we 



attribute the discrepancy to differences in peptides and 
assays (17) . 

In order to more precisely delineate the gpl20 binding site 
in the CCR5 Nt , an ELISA was used to test binding of 
soluble complexes to several CCR5 Nt sulf opept ides . 10-18 
was the smallest sulfopeptide that avidly bound soluble 
gpl20/CD4 complexes and may define the minimum docking site 
for gpl20 on CCR5 . In addition to the two sulf otyrosines 
in positions 10 and 14, negatively charged amino acids Dll 
and E18 were found to be critical for complex-peptide 
binding. It was concluded that a cluster of negative 
charges in the CCR5 Nt appears to represent the principal 
recognition motif for gpl20, although residues 2 to 9 
further contr-ibute to binding. Similar patterns of peptide 
reactivity were observed for recombinant gpl20s derived 
from HIV-ljR_PL (R5) and HIV-1dhi23 (R5X4) , suggesting that the 
CCR.5 Nt sulf opeptides recognize conserved structures in the 
envelope glycoprotein. Gp12Cdhi23/ however, bound about 
five-fold less that gpl20jR.pL to the sulf opeptides , which 
probably accounts for its less efficient usage of . GCR5 
(13). 

Anti-gpl20 KAbs were tested fox* their ability to inhibit 
gpl20/CD4 binding to sulf opept ides or to cell surface CCR5 . 
A number of anti-gpl2 0 MA.bs directed against conserved and 
variable regions of the envelope glycoprotein were not 
inhibitory. Only Mabs 48d and 17b, directed against CD4i 
epitopes, and ISb and PAl , directed against the V3 loop, 
efficiently inhibited gpl20 binding to the 2-18 



sulfopeptide and to cell surface CCR5 . The CD4i epitope 
was previously shown to participate in co-receptor binding 
and residues in the V3 loop primarily determine co-receptor 
specificity (36, 37) . The results now suggest that these 
regions of gpl20 determine its association with the CCR5 
Nt . It is noted that inhibition of peptide - compl ex binding 
by 19b, which recognizes an epitope in the V3 crown, is 
inconsistant with the finding by gpl20 mutagenesis 
experiments that residues in the V3 loop crown do not 
participate in complex-peptide binding. This leads to a 
conclusion that the inhibitory effect of 19b may be steric 
hindrance . 

In order to determine more precisely the. regions of gol20 
that modulate the gpl20-CCR5 itneraction, the binding of a 
panel of gpl20 point mutants to the CCR5 Nt sulfopeptide 
and to cell surface CCR5 was tested. The mutants were 
created by the introduction of single alanine substitutions 
near or within regions previously shown to be important for 
the integrity of the CD4i epitope and/or CCR5 binding (36, 
37) . Highly conserved residues in C4 and the V3 loop stem., 
including for arginines and a lysine, were found .to affect 
binding of gpl20 to the CCR5 Nt sulfopeptide (Figure 13) . 
These residues are located in two random coil segments of 
C4 that straddle the V3 loop stem and may constitute a 
positively charged CCR5 Nt binding domain (22) . 
Additional, conserved residues in the crown of the V3 loop, 
CI, C2 and C3 contribute to gpl20 binding to cell surface 
CCR5 (Figure 13) . It remains to be determined whether 
these residues interact with other extracellular domiains of 
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CCR5 or whether they influence the conformation of C4 and 
the V3 loop stem in a way that is only relevant in the 
context of gpl20/CD4 binding to cell surface CCR5 . It is 
unlikely that these residues also "interact with the Nt in 
the context of cell surface CCR5 because they are 
relatively distal from the C4 and V3 residues that were 
implicated in sulfopeptide binding (22) . 

To date, several lines of evidence suggest that gpl20 binds 
to more that one region of the CCR5 co-receptor: (1) the 
affinity of gpl20s/CD4 for the CCR5 Nt sulfopeptide is 
approximately 10-100-fold lower that for the native, 
membrane-associated co-receptor (11, 42, 46), (2) co- 
receptor chimera studies implicate the extracellular loops 
in viral fusion and entry (2, 12, 34, 38) and (3) 
inhibitors of CCR5 co-receptor function such as Mabs 2D7 
and PA14, as well as TAK-779 do not bind to the CCR5 Nt yet 
block gpl20/CD4 binding to CCR5 (14, 32) . The present 
findings could be interpreted to support a distributed 
model for gpl20-CCR5 interactions that mirrors the two-site 
paradigm proposed for the interaction of certain chemokines 
with their receptors (30, 45). In this model, binding is 
initially driven by electrostatic interactions between 
negatively charged residues in the receptor Nt and basic 
surfaces -on the chemokine ligand. This binding serves to- 
orient the ligand and promote its interactions with other 
portions of the chemokine receptor. The V3 loop crown may 
form initial electrostatic interactions with the 
extracellular loops of CCR5 , which would allow the CCR5 Nt 
to bind to a conserved region of gpl20 comprising residues 



in C4 and the V3 loop stem. Alternatively, the CCR5 Nt 
could first bind the C4/V3 stem domain, which would them 
promote an interaction of the loop with some other 

region of CCR5 . All of these interactions involve 

additional gpl20 residues that we have yet to identify. 
The role of the putative second interaction is unclear but 
it may further stabilize the gpl20-CCR5 interaction, 
optimally orienting the fusion apparatus, or triggering 
gp41 conformational changes that are required for fusion. 

The findings present us with a seeming paradox wherein nine 
residues of the CCR5 Nt confer specificity on the CCR5- 
gpl20 interaction by binding to gpl20 residues that are 
highly conserved among clade B isolates, regardless of 
their co-receptor usage. However, although the C4 and V3 
stem residues themselves are conserved, their precise 
placement may differ for R5 and X4 viruses. Clearly, 
relatively minor differences in the orientation, exposure 
or relative positioning of these widely separated residues 
could abrogate binding to a short peptide but not a MAb 
(e.g., 17b) possessing a larger, more distributed binding 
site (37). In addition, more variable amino acids (e.g., 
324) within or outside the C4/V3 loop stem may contribute 
to the specificity of the gpl20-Nt interaction, and we 
showed that residues N279, R313, ?369 and R444 participate 
in gpl20/CD4 binding to cell surface CCR5 but not to CCR5 
Nt sulfopeptides . Future studies employing additional 
gpl20 mutants together with CCR5 m.utants and CXCR4-based 
sulfopeptides will shed light on the specificity 
determinants of the gpl2 0 - co- receptor interaction. 
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CD4 and CCR5 mediate fusion and entey of R5 KIV-1 strains. 
Sulfotyrcsines and negatively charged residues in the CCR5 
Nt are crucial for binding of gpl20 and viral antry. 
Soluble gpl20/CD4 complexes specifically bind to CCR5 Nt 
peptides containing sulf otyrsinces in positions 10 and 14 . 
CCR5 Nt sulfotyrcsines inhibit gpl20/CD4 binding to CCR5 as 
well as viral entry. Residues in the V3 loop and the C4 
region og gpl2 0 compose a binding site for the CCR5 amino 
terminal domain, Redisues 10-18 of the CCR5 Nt constitute a 
minimal binding domain for gpl20: sulfotyrcsines YIO and 
Y14 and negatively charged residues Dll and ElB are 
important for binding. The CCR5 Nt terninal binding site on 
gpl20 is composed mostly of residues in the V3 loop stem 
and the C4 domain. 
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What is claimed: 

1. A compound comprising the structure: 
6aYDINYYTSEPA 

wherein each T represents-" a threonine, each S 
represents a serine, each E represents a glutamic 
acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined by peptide* bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
direction; 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the P at position 
19 and extending therefrom in the carbcxy terminal 
direction; 

wherein 0 represents an amino group or an acetylated 
amino group; vjherein X i^epresents a carboxyl group or 
an ami dated carboxyl group; 

wherein all of a, Y, D, I , N, Y, Y, T, S , E and 3 are joined 
together by peptide bonds; 

further provided that at least two tyrosines in the 
compound are sulfated. 
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2. The compound of claim 1/ wherein 3 represents less 
than 17 amino acids. 

3. The compound of claim 1; wherein p represents less 
5 than 16 amino acids. 

4. The compound of claim 1, wherein 3 represents less 
than 15 amino acids. 

10 5. The compound of claim 1, wherein 3 represents less 
than 14 amino acids. 

6. The comipound of claim 1, wherein 3 represents less 
than 13 amino acids . 

15 

7. The compound of claim 1, wherein 3 represents less 
than 12 amino acids, 

8. The compound of claim 1, wherein 3 represents less 
20 than 11 amino acids. 

9. The compound of claim 1, wherein 3 represents less 
than 10 amino acids. 

25 10. The compound of claim 1, wherein 3 represents less 
than 9 amino acids . 

11. The compound of claim 1, wherein 3 represents less 
than 8 amino acids. 
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12. The compound of claim 1, wherein (B represents less 
than 7 amino acids. 

13. The compound of claim 1, wherein 3 represents less 
5 than 6 amdno acids . 

14. The compound of claim 1, wherein P represents less 
than 5 amino acids . 

10 15. The compound of claim 1, wherein (3 represents less 
than 4 amino acids. 

16. The compound of claim 1, wherein P represents less 
than 3 amino acids . 

15 

17. The compound of claim 1, wherein 3 repi-eseiits less 
than 2 amino acids . 

15. The compound of claim 1; wherein 3 represents less 
20 than 1 amino acid. 

19. The compound of claim 1, wherein a represents less 
than 3 amino acids. 

25 20. The compound of claim 1, wherein a represents less 
than 8 amino acids . 

21. The compound of claim 1, wherein a represents less 
than 7 amino acids . 

30 
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The compound of claim 1, wherein a represents less 
than 6 amino acids. 

The compound of claim 1, wherein a represents less 
than 5 amino acids , 

The compound of claim 1, wherein a represents less 
than 4 amino acids . 

The compound of claim 1, wherein a represents less 
than 3 amino acids. 

The compound of claim 1, wherein a represents less 
than 2 amino acids. 

The compound of claim 1, whex'ein a represents less 
than 1 amino acid. 

A composition comprising the compound of claim 1 and a 
detectable marker attached thereto. 

The composition of claim 28, wherein the detectable 
marker is biotin. 

The composition of claim 28, wherein the detectable 
miarksr is attached at the C- terminus of the compound. 

A composition which comprises a carrier and an amount 
of the compound of claim 1 effective to inhibit 
binding of HlV-1 to a CCR5 receptor on the sux-face of 



a CD4+ cell. 



A method of inhibiting human immunodeficiency virus 
infection of a CD4+ cell which also carries a CCR5 
receptor on its surface which comprises contacting the 
CD4+ cell with an amount of the comipound of claim 1 
effective to inhibit binding of human immunodeficiency 
virus to the CCR5 receptor so as to thei-eby inhibit 
humian immunodeficiency virus infection of the CD4 + 
cell . 

The method of claim 32, wherein the CD4-t- cell is 
present in a subject and the contacting is effected by 
administering the compound to the subject. 

A method of preventing CD4+ cells of a subject from 
becoming infected with human immunodeficiency virus 
which comprises administering to the subject an amount 
of the compound of ::laim 1 effective to inhibit 
binding of human z rnunodef iciency virus to CCR5 
receptors on the surface of the CD4+ cells so as. to 
thereby prevent the subject's CD4+ cells from becoming 
infected with human immunodeficiency virus. 

A method of treating a subject whose CD4+ cells are 
infected with human imimunodef iciency virus which 
comprises administering to the subject an amount of 
the compound of claim 1 effective to inhibit binding 
of humian immunodeficiency virus to CCR5 receptors on 
the surface of the subji^ct's CD4-r cells so as to 
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thereby treat the subject. 

36. The method of any one of claims 33-35, wherein the 
compound is administered by aerosol, intravenous, oral 

5 or topical route. 

37. The method of claim 33 or 35, wherein the subject is 
infected with HIV-1 prior to administering the 
compound to the subject. 

10 

38. The method of claim 33 or 34, wherein the subject is 
not infected with HlV-1 prior to administering the 
compound to the subject. 

15 39. The method of claim 38, wherein the subject is not 
infected with, but has been . exposed to, human 
imimtunodef iciency virus . 

40. The method of any one of claims 33-35, wherein the 
20 effective amount of the compound comprises from about 

1.0 ng/kg to about 100 mg/kg body weight of. ...the 
subj ect . 

41. The method of claim 40, wherein the effective amount 
25 of the compound comprises from about 100 ng/kg to 

about 50 mg/kg body weight of the subject. 

42. The method of claim 41, wherein the effective amount 
of the comipound comprises from about 1 jug/kg to about 

30 10 mg/kg body weight of the subject. 



The method of claim 42, wherein the effective amount 
of the comipound comprises from about 100 M9/^g to 
about 1 mg/kg body weight of. the subject. 

The method of any one of claim.s 33-35; wherein the 
subject is a human being. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

(a) immobilizing the compound of claim 1 on a solid 
support ; 

(b) contacting the immobilized compound from step (a) 
with sufficient detectable CCR5 ligand to 
saturate all binding sites for the CCR5 ligand on 
the immobilized compound under conditions 
permitting binding of the CCR5 ligand to the 
immobilized compound so as to form a complex; 

(c) removing any unbound CCR5 ligand; 

(d) contacting the complex from step (b) with the 
agent; and 

(e) detecting whether any CCR5 ligand is displaced 
from the complex, wherein displacement of 
detectable CCR5 ligand from the complex indicates 
that the agent binds to the compound so as to 
ther-eby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

A m.ethod of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 



-123- 

comprises : 

(a) contacting the compound of claim 1 with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 

5 under conditions permitting binding of the CCR5 

ligand to the compound so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of CCR5 ligand which is 
bound to the compound in the complex; 

10 (d) contacting the complex from step (a) with the 

agent so as to displace CCR5 ligand from the 
complex; 

(e) measuring the amount of CCR5 ligand which is 
bound to the compound in the presence of the 

15 agent; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) with the amount measured in 
step (c) , wherein a reduced amount measured in 
step (e) indicates that the agent binds to the 

20 compound so as to thereby identify the agent as 

one which inhibits binding of the CCR5 ligand to 
the CCR5 receptor. 

47. A method of identifying an agent which inhibits 
25 binding of a CCR5 ligand to a CCR5 receptor which 

comprises : 

(a) immobilizing the compound of claim 1 on on a 
solid support; 

(b) contacting the immobilized compound from step (a) 
30 with the agent and detectable CCR5 ligand under 



conditions permitting binding of the CCR5 ligand 
to the immobilized compound so as. to form a 
complex; 

(c) removing any unbound CCR5 ligand; 

(d) measuring the amount of detectable CCR5 ligand 
which is bound to the immobilized compound in the 
complex; 

(e) measuring the amount of detectable CCR5 ligand 
which binds to the immobilized compound in the 
absence of the agent ; 

(f) comparing the amount 'of CCR5 ligand which is 
bound to the immobilized compound in step (e) 
with the amount measured in step (d) , wherein a 
reduced amount measured in step (d) indicates 
that the agent binds to the compound or CCR5 
ligand so as to thereby identify the agent as one 
which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

The method of claim 47, wherein the amount of the 
detectable ligand in step (a) and step (e) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptox- which 
comprises : 

(a) contacting the compound of claim 1 with the agent 
and detectable CCR5 ligand under conditions 
permitting binding of the CCR5 ligand to the 
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compound so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of detectable ' CCR5 ligand 
which is bound to the comj^ound in the complex; 

(d) measuring the amount of detectable CCR5 ligand 
which binds to the compound in the absence of the 
agent ; 

(e) comparing the amount of CCR5 ligand which is 
bound to the compound in step (c) with the amount 
measured in step (d) , wherein a reduced amount 
measured in step (c) - indicates that the agent 
binds to the compound or CCR5 ligand so as to 
thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

The method of claim 49, wherein the amount of the 
detectable ligand in step (a) and step (d) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 



The method of any one of 
detectable CCR5 ligand is 
marker . 



claim.s 45-50, wherein the 
labeled with a detectable 



A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

a) immobilising the compound of claim 1 on a solid 
support ; 

b) contacting the immobilized compound from step a) 
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with the agent dissolved or suspended in a known 
vehicle and measuring the binding signal 
generated by such contact/ 

c) contacting the immobilize'd compound from step a) 
5 with the known vehicle in the absence of the 

compound and measuring the binding signal 
generated by such contact; 

d) comparing the binding signal measured in step b) 
with the binding signal measured in step c) , * 

10 wherein an increased amount measured in step b) 

indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

15 53. The- method of claim 52, v^herein the solid support is a 
surface plasmon resonance sensor chip. 

54. The method of claim 52 or 53, wherein the binding 
signal is measured by surface plasmon resonance. 

20 

55. A method of obtaining a composition which comprises: 
(a) identifying a compound which inhibits binding of 

a CCR5 ligand to a CCR5 receptor according to the 
method of any one of claims 45-50 and 52; and 
25 (b) admixing the compound so identified or a homolog 

or derivative thereof with a carrier. 
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The method of any one of claims 45-50 and 52, wherein 
the CCR5 ligand is a complex comprising an HIV-1 
envelope glycoprotein and a CD4 -based protein. 
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57. The method of claim 56, wherein the HIV-1 envelope 
glycoprotein is gpl20, gpl40,or gplGO. 

5 58. The method of claim 56, wherein the CD4-based protein 
is soluble CD4 or CD4-IgG2. 

59. The method of any one of claims 45-50 and 52, whex^ein 
the CCR5 ligand is a chemokine . 

10 

60. The method of claim 59, ' wherein the chemokine is 
RANTES, MlP-la or MIP-1&. 

61. The method of any one of claims 45-50 and 52, wherein 
15 the CCR5 ligand is an antibody, 

62. The method of claim 61, whei-ein the antibody is 
selected from the group consisting of PAS (ATCC 
Accession No. HB-12605) , PA.IO (ATCC Accession 

20 No. 12607), PA.ll (ATCC Accession No.. HB-12608) , PA12 

(ATCC Accession No. HB-12609) . 

63. The method of claim 45 or 47, wherein the solid 
support is a microtiter plate well, a bead or surface 

25 plasmon resonance sensor chip. 

64. A compound having the structure: 

A- (aYDINYYTSEpX) „ 
wherein each T represents a threonine, each S 
30 represents a serine, each E represents a glutamic 
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acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
I at position 9 and extending therefrom in the amino 
terminal direction; 

wherein p represents from '0 to 13 ^.mino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
P at position 19 and extending therefrom in the 
carbcxy terminal direction; 

wherein X represents a carbcxyl group or an amidated 
carboxyl group; 

wherein all of a, Y, D, I , N, Y, Y; T, S , E and (3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein n is an integer from 1 to 8 , A is a polymer, 
and the solid line represents up to 8 linkers which 
attach the structure in parentheses to A, 

A compound having the structure : 
(eaYDINYYTSE(3).„-A 

wherein each T represents a threonine, each S 



represents a serine, each E represents a glutamic 
acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each I repiresents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
I at position 9 and extending therefrom in the amino 
tex-minal direction; 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
P at position 19 and extending therefrom in the 
carboxy terminal direction; 

wherein 9 represents an amino group or an acetylated 
amino group ; 

wherein all of a, Y, D, I , N, Y, Y , T, S , E and P are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein n is an integer fr-om 1 to 8, A is a polymer, 
and the solid line represents up to 8 linkers which 
attach the structure in parentheses to A. 

The compound of claim 64 or 65, wherein the polymer is 
selected from the group consisting of a linear lysine 



polymer, a branched lysine polymer, a linear arginine 
polymer, a branched arginine polymer, polyethylene 
glycol, a linear acetylated- lysine polymer, a branched 
acetylated lysine polymer, a linear chloroacetylated 
lysine polymer and a branched chloroacetylated lysine 
polymer . 

The compound of claim 1, wherein the compound is a 
peptide which comprises consecutive amino acids having 
the sequence YDINYYTSS . 

The compound of claim 67, wherein the tyrosines at 
positions 1 and 5 of the sequence YDINYYTSE are 
sulfated. 

A compound comprising the structure: 

eaYDnnynnnEpx 

wherein each E represents a glutamic acid, and each Y 
represents a tyrosine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino ^cdds, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
direction; 

wherein p represents from 0 to 13 amino acids, with 
the pi^oviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 



(' 
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forth in SEQ ID NO : 1 beginning with the P at position 
19 and extending therefrom in the carboxy terminal 
direction; 

wherein 6 represents an amino group or an acetylated 
5 amiino group; wherein A represents a carboxyl group or 

an amidated carboxyl group; 
wherein n represents any amino acid, 

wherein all of a, Y, D, n, n,y , E, n, n, E and 3 are joined 
together by peptide bonds; 
10 further provided that at least two tyrosines in the 

compound are sulfated. 

70. The compound of claim 69, wherein the compound is a 
peptide which comprises consecutive amino acids have 

15 the" sequence YDnnYIirinE . 

71. The compound of claim 70, wherein the tyrosines at 
positions 1 and 5 of the sequence YDnilYnnnE are 
sulfated . 

20 

72. A compound comprising the structure: 
BaYDINYYTSEPX 

wherein each T represents a threonine, each S 
represents a serine, each E represents a glutamic 
25 acid, each Y represents a tyrosine; each D represents 

an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 am^ino acids, with the 
proviso that if there are more than 2 amino acids, 
30 they are joined by peptide bonds in consecutive order 



and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom- in the amino terminal 
direction; 

wherein (3 represents fx'om 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive -order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the P at position 
19 and extending therefrom in the carboxy terminal 
direction; I 

wherein 0 represents an amino group- or an acetylated 
amino group; wherein X represents a carboxyl group or 
an amidated carboxyl group; 

wherein all of a , Y , D , I , N, Y , Y , T , S , E and .p are joined 
together by peptide bonds; 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein any amino acid except for the Y at position 1, 
D at position 2', Y at position 5 and E at position 9 
may be replaced with a homologous amino acid. . ,. , 

The compound of claim 72, wherein any I amino acid 
residue is be replaced with a G,A,V or L amino acid 
residue . 

The compound of claim 72, wherein any N amino acid 
residue is replaced with a Q amino acid residue. 



The compound of claim 72, wherein any Y amino acid 
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residue is replaced with an F or W amino acid residue. 

76. The compound of claim 72, ,:i^?herein any T amino acid 
residue is replaced with an S amino acid residue, 

5 

77. The compound of claim 72, wherein any S is replaced 
with a T amino acid residue. 

78. The compound of claim 72, wherein any C is replaced 
10 with an M, S, T, A, G, N, or Q amino acid residue. 
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SEQUENCE LISTING 

<110> Progenies Pharmaceuticals, Inc., et al. 

<120> SULFATED CCR5 PEPTIDES FORHIV-l DsTECTION 

<130> 204S/61010-A-PCT/JPW/SHS/AX 

<140> NOT YET KNOWN 

<141> 2001-02-28 

<160> 17 

<170> PatentLi version 3.0 

i 

<210> 1 , 
<211> 352 
<212> PRT 
<213> human 
<400> 1 

Met Asp Tyr Gin Val Ser Ser Pro He Tyr Asp He Asn Tyr Tyr Thr 
15 10 15 

Ser Glu Pro Cys Gin Lys He Asn Val Lys Ghi He Ala Ala Arg Leu 
20 25 30 

Leu Pro Pro Leu Tyr Ser Leu Val Phe He Phe Gly Phe Val Gly Asn 
35 40 45 

Met Leu Val He Leu He Leu He Asn Cys Lys Axg Leu Lys Ser Met 
50 55 60 

Tin- Asp lie Tyi^ Leu Lexi Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 80 

Leu Tlu- Val Pro Phe Tip Ala His Tyi" Ala Ala Ala Ghi Tip Asp Phe 
85 90 95 



r 



2 

Gly Asn Tlir Met Cys Gin Leu Leu Tlir Gly Leu Tyr Phe He Gly Phe 
100 105 110 

Phe Ser Gly He Phe Phe He He Leu Leu Thr He Asp Arg Tyr Leu 
115 120 125 

Ala Val Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 
130 135 140 

Gly Val Val Thx Ser Val He Thi" Tip Val Val Ala Val Phe Ala Ser 
145 150 155 160 

Leu Pro Gly He He Phe Tlir Arg Ser Gin Lys Glu Gly Leu His Tyr 
165 170 175 

Tar Cys Ser Ser His Phe Pro Tyr Ser Gin Tyi- Ghi Phe Tip Lys Asn 
180 185 190 

Phe Ghi Tlax" Leu Lys He Val He Leu Gly Leu Val Leu Pro Leu Leu 
195 200 205 

Val Met Valile Cys Tyr Ser Gly He Leu Lys Thr Leu Leu Arg Cys 
210 215 220 

Ai-g Asn Glu Lys Lys Arg His Arg Ala Val Arg Leu He Phe Thr He 
225 230 235 240 

Met He Val Tyi- Phe Leu Phe Tip Ala Pro Tyr Asn He Val Leu Leu- 
245 250 255 

Leu Asn Tin' Phe Ghi Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 
260 265 270 

Asn Arg Leu Asp Gin Ala Met Gin Val Tlu Glu Thr Leu Gly Met Tlix 
275 280 285 

His Cys Cys He Asn Pro He He Tyr Ala Phe Val Gly Glu Lys Phe 
290 295 300 

Ai-g Asn Tyi- Leu Leu Val Phe Phe Gin Lys His He Ala Lys Arg Phe 
305 310 315 320 

Cys Lys Cys Cys Ser He Phe Gin Gin Glu Ala Pro Glu Arg Ala Ser 
325 330 335 



Sei- Val Tyr Tlir Arg Ser Tlir Gly Glu Gin Glu He Ser Val Gly Leu 
340 345 350 



<210> 2 
<211> 1376 
<212> DNA 
<213> human 

<400> 2 

gaattccccc aacagascca agctctccat ctagtggaca gggaagctag cagcaaacct 60 

i 

tcccttcact acaaaacttc attgcttggc caaaaagaga gttaattcaa tgtagacatc 120 
tatgtaggca attaaaaacc tattgatgta taaaacagtt tgcattcatg gagggcaact 1 SO 
aaatacattc taggacttta taaaagatca ctttttattt atgcacaggg tggaacaaga 240 
tggattatca agtgtcaagt ccaatctatg acatcaatta ttatacatcg gagccctgcc 300 
aaaaaatcaa tgtgaagcaa atcgcagccc gcctcctgcc tccgctctac tcactggtgt 360 
tcatctttgg ttttgtgggc aacatgctgg tcatcctcat cctgataaac tgcaaaaggc 420 
tgaagagcat gactgacatc tacctgctca acctggccat ctctgacctg tttttccttc 480 
ttactgtccc cttctgggct cactatgctg ccgcccagtg ggactttgga aatacaatgt 540 
gtcaactctt gacagggctc tattttatag gcttcttctc tggaatcttc ttcatcatcc 600 
tcctgacaat cgataggtac ctggctgtcg tccatgctgt gtttgcttta aaagccagga 660 
cggtcacctt tggggtggtg acaagtgtga tcacttgggt ggtggctgtg tttgcgtctc 720 
tcccaggaat catctttacc agatctcaaa aagaaggtct tcattacacc tgcagctctc 780 
attttccata cagtcagtat caattctgga agaatttcca gacattaaag atagtcatct 840 
tggggctggt cctgccgctg cttgtca.tgg tcatctgcta ctcgggaatc ctaaaaactc 900 



( 



4 

tgcttcggtg tcgaaatgag aagaagaggc acagggctgt gaggcttatc ttcaccatca 960 
tgattgttta ttttctcttc tgggctccct acaacattgt ccttctcctg aacaccttcc 1 020 
aggaattctt tggcctgaat aattgcagta gctctaacag gttggaccaa-'gctatgcagg 1080 
tgacagagac tcttgggatg acgcactgct gcatcaaccc catcatctat gcctttgtcg 1 140 
gggagaagtt cagaaactac ctcttagtct tcttccaaaa gcacattgcc aaacgcttct 1200 
gcaaatgctg VicX^XXXic cagcaagagg ctcccgagcg agcaagctca gtttacaccc 1260 
gatccactgg ggagcaggaa atatctgtgg gcttgtgaca cggactcaag tgggctggtg 1320 
acccagtcag agttgtgcac atggcttagt tttcatacac agcctgggct gggggt 1376 

<210> 3 
<211> 49 
<212> PRT- 

<213> human inmiunodeficiency viiiis 
<400> 3 

Ai-g Gha Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 
15 10 15 

Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Tlir Val Tip Gly He 
20 25 30 

Lys Gin Leu Gin Ala Ai'g He Leu Ala Val Glu Arg Tyi* Leu Lys Asp 
35 40 45 

Gin 

<210> 4 



<211> 35 



<212> PRT 

<213> human inummodeficiency vims 
<400> 4 

Tip Met Glu Tip Asp Ai-g Glu He Asn Asn Tyr Thr Ser Leu He His 
15 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin GLn Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu Glu 

35 , 

1 

I 

<210> 5 
<211> 22 
<212> PRT 
<213> human 

<400> 5 

Leu Leu Tlir Val Glu Gin Ala Leu Ala Asp Phe Ala Glu Leu Leu Ai* 
1 5 . 10 15 

Ala Leu Arg Aj-g Asp Leu 
20 

<210> 6 
<211> 33 
<212> PRT 



{ 



6 

<213> human 
<400> 6 

His Met Lys Gin Leu Glu Asp Lys Val Glu Glti Leu Leu Ser Lys Asn 
1 5 10 15 

Tyi- His Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu 
20 25 30 

Arg 

<210> 7 
<211> 33 
<212> PRT 
<213> hvunan 

<400> 7 

His Met Lys Gin He Glu Asp Lys He Glu Glu He Leu Ser Lys He 
1 5 10 . 15 

Tyi- His He Glu Asn Glu He Ala Arg He Lys Lys Leu He Gly Glu 
20 25 30 

Val 

<210> 8 
<211> 40 
<212> PRT 
<213> human 



7 



<400> 8 

Leu Tlir Asp Thi- Leu Glii Ala Glu Thr Asp Gin Leu Glu Asp Glu Lys 
15 10 15 

Ser Ala Leu Gin Tlir Glu He Ala Asn Leu Leu Lys Glu Lys Glu Lys 
20 25 30 

Leu Glu Phe He Leu Ala Ala Arg 
35 . 40 • 

<210> 9 

<211> 40 

<212> PRT ' 
<213> human ' 

<400> 9 

His Met Arg Ai-g He Ala Arg Leu Glu Glu Lys Val Lys Thr Leu Lys 
15 10 15 

Ala Gin Asn Ser Glu Leu Ala Ser Tlir Ala Asn Met Leu Ai-g Glu Gin 
20 25 30 

Val Ala Gin Leu Lys Gin Lys Tyr 
35 40 

<210> 10 
<211> 36 
<212> PRT 
<213> unlaiown 



<220> 

<221> PEPTIDE 
<222> (1)..(36) 
<223> T-20 

<400> 10 

Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Tip Ala Ser Leu 
20 25 30 

Tip Asn Tip Plie 
35 

<210> 11 
<211> 38 
<212> PRT 
<213> unlaiown 

<220> 

<221> PEPTIDE 
<222> (1)..(38) 
<223> DP 107 
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<400> 11 

Asn Asn L&u Lexi Aj-g Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 - 

Till- Val Tip Gly He Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Gin 
35 

<210> 12 
<211> 34 

<212> PRT ■ 

<213> milcnown ' 

<220> 

<221> PEPTIDE 
<222> (1)..(34) 
<223> N34 



<400> 12 

Ser Gly He Val Glii Glri Gin Asn Asn Leu Leu Ai-g Ala He Glu Ala 
15 10 15 

Ghi Gill His Leu Leu Gin Leu Tin* Val Tip Gly He Lys Gin Leu Gin 
20 25 30 

Ala Ajg 



10 

<210> 13 
<211> 28 
<212> PRT 
<213> uiilaiown 

<220> 

<221> PEPTIDE 
<222> Cl)--(28) 
<223> CIS 

<400> 13 

Tip Met Glu Tip Asp Axg Glu He Asn Asn Tyr Thr Ser Leu He His 
15 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys 
20 25 

<210> 14 

<211> 6S 

<212> PRT 

<213> unlaiown 

<220> 

<221> PEPTIDE 
<222> C1)--(6S) 



11 

<223> N34(L6)C2S 
<400> 14 

Ser Gly He Val Gin Ghi Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu TIxl* Val Tip Gly lie Lys Gin Leu Gin 
20 25 30 

Ala Ai-g Ser Gly Gly Arg Gly Gly Tip Met Glu Trp Asp Aj-g Glu He 
35 40 45 

Asn Asn Tyi* Tin- Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn 
50 55 60 

Ghi Gin Glu Lys 

65 ' 
<210> 15 
<211> 39 
<212> PRT 
<213> unlmown 

<220> 

<221> PEPTIDE 
<222> (1)..(39) 
<223> T1249 

<400> 15 

Tip Gin Glu Tip Glu Gin Lys He Tlxr Ala Leu Leu Glu GLii Ala Gin 
15 10 15 



12 

lie Gin Gill Glu Lys Asn Glu T>t Glu Leu Gin Lys Leu Asp Lys Trp 
20 25 30 

Ala Ser Leu Tip Glu Trp Phe 
35 

<210> 16 
<211> 502 
<212> PRT 

<213> human iminunodeficiency vii-us 



<400> 16 

I 

Met Ai*g Val Lys Gly lie Arg Lys Ser Tyr Glii Tyr Leu Trp Lys Gly 
15 10 15 ^ 

Gly Tin- Leu Leu Leu Gly He Leu Met He Cys Ser Ala Val Glu Lys 
20 25 30 

Leu Trp Val Tin* Val Tyr Tyr Gly Val Pro Val Tip Lys Glu Ala Thr 
35 40 45 

Tlu- Tl-u- Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Tlir Glu Val 
50 55 60 

His Asn Val Tip Ala Tlii' His Ala Cys Val Pro Tlir Asp Pro Asn Pro 
65 70 75 SO 

Gin Glu Val Val Leu Glu Asn Val Tin- Glu His Phe Asn Met Tip Lys 
85 90 95 

Asn Asn Met Val Glu Ghi Met Gin Glu Asp He He Ser Leu Tip Asp 
100 105 110 

Ghi Ser Leu Lys Pro Cys Val Lys Leu Tin* Pro Leu Cys Val Tlir Leu 
115 120 125 

Asn Cys Lys Asp Val Asn Ala Tlu Asn Tlu: Tlu: Asn Asp Ser Glu Gly 
130 135 140 



( 
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Till- Met Glu Ai-g Gly Glu He Lys Asn Cys Ser Phe Asn He Thi- Tlu 
145 150 155 160 

Ser He Aig Asp Glu Val Gin Lys Glu Tyr Ala Leu-Phe Tyr Lys Leu 
165 170 175 

Asp Val Val Pro He Asp Asn Asn Asn Tlir Ser Tyr Ai-g Leu He Ser 
180 1S5 190 

Cys Asp Tlxr Ser Val He Tlir Gin Ala Cys Pro Lys He Ser Phe Glu 
195 200 205 

Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu Lys 
210 215 220 

Cys Asn Asp Lys Tlir Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser 
225 230 235 240 

Till- Val GliT Cys Tlu His Gly He Ai-g Pro Val Val Ser Tlir Ghi Leu 
245 250 255 

Leu Leu Asri Gly Ser Leu Ala Glu Glu Glu Val Val He Ai'g Ser Asp 
260 265 270 

Asn Phe Tlir Asn Asn Ala Lys Tin* He He Val Gin Leu Lys Glu Ser 
275 280 285 

Val Glu lie Asn Cys Tin- Ai-g Pro Asn Asn Asn Thr Ai-g Lys Ser He 
290 295 300 

His He Gly Pro Gly Arg Ala Phe Tyr Tlir Tlir Gly Glu He He Gly 
305 ' 310 315 320 

Asp He Arg Gin Ala His Cys Asn He Ser Ai-g Ala Lys Trp Asn Asp 
325 330 335 

Tlir Leu Lys Ghi He Val He Lys Leu Aig Glu Gin Phe Glu Asn Lys 
340 345 350 

Till- He Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu He Val Met 
355 360 365 

His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thi* Ghi 
370 375 3S0 
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Leu Phe Asn Ser Thx Trp Asn Asn Asn Thi" Glu Gly Ser Asn Asn Tlii- 
385 390 395 400 

Ghi Gly Asn Thx He Tin* Leu Pro Cys Arg He Lys Glii He He Asn 
405 410 415 

Met Trp Gin Glu Val Gly Lys Ala Met Tyi- Ala Pro Pro He Aj-g Gly 
420 425 430 

Gin He Axg Cys Ser Ser Asn He Tin* Gly Leu Leu Leu Tlu Arg Asp 
435 440 445 

Gly Gly He Asn Glu Asn Gly Tlir Glu He Phe Arg Pro Gly Gly Gly 
450 455 460 

Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys \[al Val 
465 470 475 480 

Lys He Glu Pro Leu Gly Val Ala Pro Tlir Lys Ala Lys Ai'g Arg* Val 
485 490 495 

Val Gin Ai'g Glu Lys Arg 
500 



<210> 17 
<211> 511 * 
<212> PRT 

<213> human inraiunodeficiency virus 



<400> 17 

Met Arg Val Lys Glu Lys Tyr Gin Kis Leu Trp Arg Trp Gly Trp Ai'g 
1^5 10 15 

Trp Gly Tin- Met Leu Leu Gly Met Leu Met He Cys Ser Ala Tin Glu 
20 25 30 

Lys Leu Tip Val Tin Val Tyi' Tyi" Gly Val Pro Val Trp Lys Glu Ala 
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35 40 45 

Thx Tlu Tin- Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Tlir Glu 
50 55 60 

Val ffis Asn Val Tip Ala Thj" His Ala Cys Val Pro Tlir Asp Pro Asn 
65 70 75 SO 

Pro Ghi Glu Val Val Leu Val Asn Val Tin- Glu Asn Phe Asn Met Trp 
85 90 95 

Lys Asn Asp Met Val Glu Gin Met His Glu Asp He lie Ser Leu Trp 
100 105 110 

Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Tlir Pro Leu Cys Val Ser 
115 120 125 

Leu Lys Cys Tlir. Asp Leu Lys Asn Asp Tin- Asn Thi- Asn Ser Ser Ser 
130 135 140 

Gly Ai-g Met He Met Glu Lys Gly Ghi lie Lys Asn Cys Ser Phe Asn 
145 ^ ■ 150 155 160 

He Ser Thr Ser He Arg Gly Lys Val Gin Lys Glu Tyr Ala Phe Phe 
165 170 175 

Tyr Lys Leu Asp He He Pro He Asp Asn Asp Tlir Thr Ser Tyr Lys 
180 185 190 

Leu Tlu- Ser Cys Asn Thr Ser Val lie Tin- Ghi Ala Cys Pro Lys Val 
195 200 205 

Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala 
210 215 220 

He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Tlir 
225 230 235 240 

Asn Val Ser Thr Val Ghi Cys Tin- His Gly He Ai-g Pro Val Val Ser 
245 250 255 

Tin- Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He 
260 265 270 
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Ai-g Ser Val Asn Phe Thr Asp Asn Ala Lys Tlir He He Val Gin Leu 
^ 275 2S0 285 

Asn Tin- Ser Val Gin He Asn Cys Tlir Axg Pro Asn Asn Asn Thr Arg 
290 295 300 

Lys Ai-g He Ar-g He Gin Ai-g Gly Pro Gly Ai-g Ala Phe Val Tin- He 
305 ^ 310 315 320 

Gly Lys He Gly Asn Met Ai'g Gin Ala His Cys Asn He Ser Ai'g Ala 
325 330 335 

Lys Tip Asn .Asn Thr Leu Lys Gin He Ala Ser. Lys Leu Axg Glu Ghi 
340 345 350 

Phe Gly Asn Asn Lys Thr He He Phe Lys Gin Ser Ser Gly Gly Asp 
3"55 360 365 

Pro Glu He Val Tin- His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 
370 375 380 

Cys Asn Ser Tlir Gin Leu Phe Asn Ser Tin: Tip Phe Asn Ser Tin" Trp 
385 390 395 400 

Ser Tin- Glu Gly Ser Asn Asn Tin* Glu Gly Ser Asp Tin: He Tin- Leu 
405 410 415 

Pro Cys Arg He Lys Gin He He Asn Met Tip Gin Lys Val Gly Lys 
420 425 430 

Ala Met Tyi- Ala Pro Pro He Ser Gly Gin He Ai-g Cys Ser Ser Asn 
435 440 445 

He Tin- Gly Leu Leu Leu Tin Ai'g Asp Gly Gly Asn Ser Asn Asn Glu 
450 455 460 

Ser Glu He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Tip Arg 
465 470 475 480 

Ser Glu Leu Tyr Lys Tyi* Lys Val Val Lys He Glu Pro Leu Gly Val 
485 " 490 495 

Ala Pro Tin- Lys Ala Lys Ai-g Arg Val Val Gin Arg Glu Lys Arg 
500 505 510 



